Main Page

Table of Contents

Author Index

Table of Contents

Welcome Message from the Chairs (Page iii)
Ulle Endriss (AAMAS-2021 Programme Chair)

Ann Nowé (AAMAS-2021 Programme Chair)

Frank Dignum (AAMAS-2021 General Chair)

Alessio Lomuscio (AAMAS-2021 General Chair)

Conference Organisation (Page xxv)

Area Chairs (Page xxvi)

Senior Programme Committee (Page xxvii)

Programme Committee (Page xxviii)

Auxiliary Reviewers (Page xxxiii)

Special Track Reviewers (Page xxxv)

Awards (Page xxxvii)

Sponsors & Supporters (Page xxxix)

Blue Sky Ideas Track

Extended Abstracts

Demonstration Track

Main Track


Doctoral Consortium

Blue Sky Ideas Track

Broadening the Research Agenda for Computational Social Choice: Multiple Preference Profiles and Multiple Solutions (Page 1)
Niclas Boehmer (TU Berlin)
Rolf Niedermeier (TU Berlin)

Models We Can Trust: Toward a Systematic Discipline of (Agent-Based) Model Interpretation and Validation (Page 6)
Gabriel Istrate (West University of Timişoara & the e-Austria Research Institute)

Cognitive Homeostatic Agents (Page 12)
Amol Kelkar (

Multi-modal Agents for Business Intelligence (Page 17)
Jeffrey O. Kephart (IBM Thomas J. Research Center)

Environment Shift Games: Are Multiple Agents the Solution, and not the Problem, to Non-Stationarity? (Page 23)
Alexander Mey (Delft University of Technology)
Frans A. Oliehoek (Delft University of Technology)

The Seeing-Eye Robot Grand Challenge: Rethinking Automated Care (Page 28)
Reuth Mirsky (The University of Texas at Austin)
Peter Stone (The University of Texas at Austin)

Sparse Training Theory for Scalable and Efficient Agents (Page 34)
Decebal Constantin Mocanu (University of Twente)
Elena Mocanu (University of Twente)
Tiago Pinto (Polytechnic Institute of Porto)
Selima Curci (Eindhoven University of Technology)
Phuong H. Nguyen (Eindhoven University of Technology)
Madeleine Gibescu (Utrecht University)
Damien Ernst (University of Liège)
Zita A. Vale (Polytechnic Institute of Porto)

Autonomous Agents and Multiagent Systems Challenges in Earth Observation Satellite Constellations (Page 39)
Gauthier Picard (ONERA DTIS, Université de Toulouse)
Clément Caron (Airbus Defence and Space)
Jean-Loup Farges (ONERA DTIS, Université de Toulouse)
Jonathan Guerra (Airbus Defence and Space)
Cédric Pralet (ONERA DTIS, Université de Toulouse)
Stéphanie Roussel (ONERA DTIS, Université de Toulouse)

Better Metrics for Evaluating Explainable Artificial Intelligence (Page 45)
Avi Rosenfeld (Jerusalem College of Technology)

Diverse Auto-Curriculum is Critical for Successful Real-World Multiagent Learning Systems (Page 51)
Yaodong Yang (University College London & Huawei R&D U.K.)
Jun Luo (Huawei Canada)
Ying Wen (Shanghai Jiao Tong University)
Oliver Slumbers (University College London)
Daniel Graves (Huawei Canada)
Haitham Bou Ammar (Huawei R&D U.K.)
Jun Wang (University College London & Huawei R&D U.K.)
Matthew E. Taylor (University of Alberta & Alberta Machine Intelligence Institute)

Responsibility Research for Trustworthy Autonomous Systems (Page 57)
Vahid Yazdanpanah (University of Southampton)
Enrico H. Gerding (University of Southampton)
Sebastian Stein (University of Southampton)
Mehdi Dastani (Utrecht University)
Catholijn M. Jonker (Delft University of Technology)
Timothy J. Norman (University of Southampton)

Mechanism Design Powered by Social Interactions (Page 63)
Dengji Zhao (ShanghaiTech University)

(Return to Top)

Main Track

Reason Explanation for Encouraging Behaviour Change Intention (Page 68)
Amal Abdulrahman (Macquarie University)
Deborah Richards (Macquarie University)
Ayse Aysin Bilgin (Macquarie University)

Off-Policy Exploitability-Evaluation in Two-Player Zero-Sum Markov Games (Page 78)
Kenshi Abe (CyberAgent, Inc.)
Yusuke Kaneko (CyberAgent, Inc.)

Siting and Sizing of Charging Infrastructure for Shared Autonomous Electric Fleets (Page 88)
Ramin Ahadi (University of Cologne)
Wolfgang Ketter (University of Cologne)
John Collins (University of Minnesota)
Nicolò Daina (Imperial College London)

Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection (Page 97)
Lucas N. Alegre (Federal University of Rio Grande do Sul)
Ana L. C. Bazzan (Federal University of Rio Grande do Sul)
Bruno C. da Silva (University of Massachusetts, Amherst)

Interrogating the Black Box: Transparency through Information-Seeking Dialogues (Page 106)
Andrea Aler Tubella (Umeå University)
Andreas Theodorou (Umeå University)
Juan Carlos Nieves (Umeå University)

Cooperation and Reputation Dynamics with Reinforcement Learning (Page 115)
Nicolas Anastassacos (University College London)
Julian García (Monash University)
Stephen Hailes (University College London)
Mirco Musolesi (University College London & University of Bologna)

State-Aware Variational Thompson Sampling for Deep Q-Networks (Page 124)
Siddharth Aravindan (National University of Singapore)
Wee Sun Lee (National University of Singapore)

Multi-Robot Task Allocation-Complexity and Approximation (Page 133)
Haris Aziz (University of New South Wales Sydney)
Hau Chan (University of Nebraska-Lincoln)
Ágnes Cseh (University of Potsdam)
Bo Li (The Hong Kong Polytechnic University)
Fahimeh Ramezani (University of New South Wales Sydney)
Chenhao Wang (University of Nebraska-Lincoln)

Robustness Based on Accountability in Multiagent Organizations (Page 142)
Matteo Baldoni (Università di Torino)
Cristina Baroglio (Università di Torino)
Roberto Micalizio (Università di Torino)
Stefano Tedeschi (Università di Torino)

Predicting Voting Outcomes in Presence of Communities (Page 151)
Jacques Bara (University of Warwick)
Omer Lev (Ben-Gurion University)
Paolo Turrini (University of Warwick)

Cooperative Prioritized Sweeping (Page 160)
Eugenio Bargiacchi (Vrije Universiteit Brussel)
Timothy Verstraeten (Vrije Universiteit Brussel)
Diederik M. Roijers (Vrije Universiteit Brussel & HU University of Applied Sciences Utrecht)

Existence and Computation of Maximin Fair Allocations Under Matroid-Rank Valuations (Page 169)
Siddharth Barman (Indian Institute of Science)
Paritosh Verma (Indian Institute of Science)

Complexity of Scheduling and Predicting Round-Robin Tournaments (Page 178)
Dorothea Baumeister (Heinrich-Heine-Universität Düsseldorf)
Tobias Alexander Hogrebe (Heinrich-Heine-Universität Düsseldorf)

Complexity of Sequential Rules in Judgment Aggregation (Page 187)
Dorothea Baumeister (Heinrich-Heine-Universität Düsseldorf)
Linus Boes (Heinrich-Heine-Universität Düsseldorf)
Robin Weishaupt (Heinrich-Heine-Universität Düsseldorf)

Optimising Long-Term Outcomes using Real-World Fluent Objectives: An Application to Football (Page 196)
Ryan Beal (University of Southampton)
Georgios Chalkiadakis (Technical University of Crete)
Timothy J. Norman (University of Southampton)
Sarvapali D. Ramchurn (University of Southampton)

Action Priors for Large Action Spaces in Robotics (Page 205)
Ondrej Biza (Northeastern University)
Dian Wang (Northeastern University)
Robert Platt (Northeastern University)
Jan-Willem van de Meent (Northeastern University)
Lawson L.S. Wong (Northeastern University)

Egalitarian Judgment Aggregation (Page 214)
Sirin Botan (University of Amsterdam)
Ronald de Haan (University of Amsterdam)
Marija Slavkovik (University of Bergen)
Zoi Terzopoulou (University of Amsterdam)

Manipulability of Thiele Methods on Party-List Profiles (Page 223)
Sirin Botan (University of Amsterdam)

Decision Model for a Virtual Agent that can Touch and be Touched (Page 232)
Fabien Boucaud (Université de Technologie de Compiègne, CNRS)
Catherine Pelachaud (Sorbonne Université, CNRS, ISIR)
Indira Thouvenin (Université de Technologie de Compiègne, CNRS)

Knowledge Improvement and Diversity under Interaction-Driven Adaptation of Learned Ontologies (Page 242)
Yasser Bourahla (University Grenoble Alpes, Inria, CNRS, Grenoble INP, LIG)
Manuel Atencia (University Grenoble Alpes, Inria, CNRS, Grenoble INP, LIG)
Jérôme Euzenat (University Grenoble Alpes, Inria, CNRS, Grenoble INP, LIG)

On the Indecisiveness of Kelly-Strategyproof Social Choice Functions (Page 251)
Felix Brandt (Technische Universität München)
Martin Bullinger (Technische Universität München)
Patrick Lederer (Technische Universität München)

High-Multiplicity Fair Allocation Made More Practical (Page 260)
Robert Bredereck (Humboldt-Universität zu Berlin)
Aleksander Figiel (TU Berlin)
Andrzej Kaczmarczyk (TU Berlin)
Dušan Knop (Czech Technical University in Prague)
Rolf Niedermeier (TU Berlin)

Multi-Agent Coordination in Adversarial Environments through Signal Mediated Strategies (Page 269)
Federico Cacciamani (Politecnico di Milano)
Andrea Celli (Politecnico di Milano)
Marco Ciccone (Politecnico di Milano)
Nicola Gatti (Politecnico di Milano)

Imitation Learning from Pixel-Level Demonstrations by HashReward (Page 279)
Xin-Qiang Cai (Nanjing University)
Yao-Xiang Ding (Nanjing University)
Yuan Jiang (Nanjing University)
Zhi-Hua Zhou (Nanjing University)

Worst-case Bounds for Spending a Common Budget (Page 288)
Pierre Cardi (Université Paris-Dauphine & Université PSL, CNRS, LAMSADE)
Laurent Gourvès (Université Paris-Dauphine & Université PSL, CNRS, LAMSADE)
Julien Lesca (Université Paris-Dauphine & Université PSL, CNRS, LAMSADE)

Classifying the Complexity of the Possible Winner Problem on Partial Chains (Page 297)
Vishal Chakraborty (University of California, Santa Cruz)
Phokion G. Kolaitis (University of California, Santa Cruz & IBM Research)

Tractable Mechanisms for Computing Near-Optimal Utility Functions (Page 306)
Rahul Chandan (University of California, Santa Barbara)
Dario Paccagnan (Imperial College London)
Jason R. Marden (University of California, Santa Barbara)

Temporal Watermarks for Deep Reinforcement Learning Models (Page 314)
Kangjie Chen (Nanyang Technological University)
Shangwei Guo (Nanyang Technological University)
Tianwei Zhang (Nanyang Technological University)
Shuxin Li (Nanyang Technological University)
Yang Liu (Nanyang Technological University)

A Game Theoretical Analysis of Non-Linear Blockchain System (Page 323)
Lin Chen (Texas Tech University)
Lei Xu (University of Texas Rio Grande Valley)
Zhimin Gao (Auburn University at Montgomery)
Ahmed Imtiaz Sunny (Texas Tech University)
Keshav Kasichainula (University of Houston)
Weidong Shi (University of Houston)

A General Trust Framework for Multi-Agent Systems (Page 332)
Mingxi Cheng (University of Southern California)
Chenzhong Yin (University of Southern California)
Junyao Zhang (University of Southern California)
Shahin Nazarian (University of Southern California)
Jyotirmoy Deshmukh (University of Southern California)
Paul Bogdan (University of Southern California)

Scalable Anytime Planning for Multi-Agent MDPs (Page 341)
Shushman Choudhury (Stanford University)
Jayesh K. Gupta (Stanford University)
Peter Morales (Microsoft)
Mykel J. Kochenderfer (Stanford University)

MOBLOT: Molecular Oblivious Robots (Page 350)
Serafino Cicerone (University of L'Aquila)
Alessia Di Fonso (University of L'Aquila)
Gabriele Di Stefano (University of L'Aquila)
Alfredo Navarra (University of Perugia)

Spatial Consensus-Prevention in Robotic Swarms (Page 359)
Saar Cohen (Bar Ilan University)
Noa Agmon (Bar Ilan University)

Rational Synthesis in the Commons with Careless and Careful Agents (Page 368)
Rodica Condurache (A. I. Cuza University of Iaşi)
Catalin Dima (LACL - Université Paris-Est Créteil)
Youssouf Oualhadj (LACL - Université Paris-Est Créteil)
Nicolas Troquard (KRDB - Free University of Bozen-Bolzano)

Loss Bounds for Approximate Influence-Based Abstraction (Page 377)
Elena Congeduti (Delft University of Technology)
Alexander Mey (Delft University of Technology)
Frans A. Oliehoek (Delft University of Technology)

Scalable Multiagent Driving Policies for Reducing Traffic Congestion (Page 386)
Jiaxun Cui (University of Texas at Austin)
William Macke (University of Texas at Austin)
Harel Yedidsion (University of Texas at Austin)
Aastha Goyal (University of Texas at Austin)
Daniel Urieli (General Motors R&D Labs)
Peter Stone (University of Texas at Austin & Sony AI)

Improved Cooperation by Exploiting a Common Signal (Page 395)
Panayiotis Danassis (École Polytechnique Fédérale de Lausanne (EPFL))
Zeki Doruk Erden (École Polytechnique Fédérale de Lausanne (EPFL))
Boi Faltings (École Polytechnique Fédérale de Lausanne (EPFL))

A Heuristic Algorithm for Multi-Agent Vehicle Routing with Automated Negotiation (Page 404)
Dave de Jonge (IIIA-CSIC)
Filippo Bistaffa (IIIA-CSIC)
Jordi Levy (IIIA-CSIC)

Walrasian Equilibria in Markets with Small Demands (Page 413)
Argyrios Deligkas (Royal Holloway University of London)
Themistoklis Melissourgos (Technical University of Munich)
Paul G. Spirakis (University of Liverpool & University of Patras)

Modeling Replicator Dynamics in Stochastic Games Using Markov Chain Method (Page 420)
Chuang Deng (Shanghai Jiao Tong University)
Zhihai Rong (University of Electronic Science and Technology of China)
Lin Wang (Shanghai Jiao Tong University)
Xiaofan Wang (Shanghai University)

Explaining BDI Agent Behaviour through Dialogue (Page 429)
Louise A. Dennis (University of Manchester)
Nir Oren (University of Aberdeen)

Network Robustness via Global k-cores (Page 438)
Palash Dey (Indian Institute of Technology, Kharagpur)
Suman Kalyan Maity (Northwestern University)
Sourav Medya (Northwestern University)
Arlei Silva (University of California, Santa Barbara)

Efficient Nonmyopic Online Allocation of Scarce Reusable Resources (Page 447)
Zehao Dong (Washington University in St. Louis)
Sanmay Das (George Mason University)
Patrick Fowler (Washington University in St. Louis)
Chien-Ju Ho (Washington University in St. Louis)

Learning Correlated Communication Topology in Multi-Agent Reinforcement learning (Page 456)
Yali Du (University College London)
Bo Liu (Institute of Automation, Chinese Academy of Sciences)
Vincent Moens (Huawei R&D UK)
Ziqi Liu (University College London)
Zhicheng Ren (University of California, Los Angeles)
Jun Wang (University College London)
Xu Chen (Renmin University of China)
Haifeng Zhang (Institute of Automation, Chinese Academy of Sciences)

Log-time Prediction Markets for Interval Securities (Page 465)
Miroslav Dudík (Microsoft Research)
Xintong Wang (University of Michigan)
David M. Pennock (Rutgers University)
David M. Rothschild (Microsoft Research)

An Abstraction-based Method to Check Multi-Agent Deep Reinforcement-Learning Behaviors (Page 474)
Pierre El Mqirmi (Imperial College London)
Francesco Belardinelli (Imperial College London)
Borja G. León (Imperial College London)

Safe Multi-Agent Reinforcement Learning via Shielding (Page 483)
Ingy ElSayed-Aly (University of Virginia)
Suda Bharadwaj (University of Texas at Austin)
Christopher Amato (Northeastern University)
Rüdiger Ehlers (Clausthal University of Technology)
Ufuk Topcu (University of Texas at Austin)
Lu Feng (University of Virginia)

A Knowledge Compilation Map for Conditional Preference Statements-based Languages (Page 492)
Hélène Fargier (IRIT-CNRS, Université de Toulouse)
Jérôme Mengin (IRIT-CNRS, Université de Toulouse)

Self-Imitation Advantage Learning (Page 501)
Johan Ferret (Google Research & Inria Lille Nord Europe)
Olivier Pietquin (Google Research)
Matthieu Geist (Google Research)

Strategyproof Facility Location Mechanisms on Discrete Trees (Page 510)
Alina Filimonov (Technion - Israel Institute of Technology)
Reshef Meir (Technion - Israel Institute of Technology)

Probabilistic Control Argumentation Frameworks (Page 519)
Fabrice Gaignier (University of Paris)
Yannis Dimopoulos (University of Cyprus)
Jean-Guy Mailly (University of Paris)
Pavlos Moraitis (University of Paris and Argument Theory)

Quantified Announcements and Common Knowledge (Page 528)
Rustam Galimullin (University of Bergen)
Thomas Ågotnes (University of Bergen & Southwest University)

Partially Observable Mean Field Reinforcement Learning (Page 537)
Sriram Ganapathi Subramanian (University of Waterloo)
Matthew E. Taylor (University of Alberta & Alberta Machine Intelligence Institute (Amii))
Mark Crowley (University of Waterloo)
Pascal Poupart (University of Waterloo & Vector Institute)

On a Notion of Monotonic Support for Bipolar Argumentation Frameworks (Page 546)
Anis Gargouri (University Artois, CNRS, UMR 8188, CRIL)
Sébastien Konieczny (CNRS, University Artois, UMR 8188, CRIL)
Pierre Marquis (University Artois, CNRS, IUF, UMR 8188, CRIL)
Srdjan Vesic (CNRS, University Artois, UMR 8188, CRIL)

Multivariate Analysis of Scheduling Fair Competitions (Page 555)
Siddharth Gupta (Ben-Gurion University of the Negev)
Meirav Zehavi (Ben-Gurion University of the Negev)

Action Selection for Composable Modular Deep Reinforcement Learning (Page 565)
Vaibhav Gupta (International Institute of Information Technology, Hyderabad)
Daksh Anand (International Institute of Information Technology, Hyderabad)
Praveen Paruchuri (International Institute of Information Technology, Hyderabad)
Akshat Kumar (Singapore Management University)

Equilibrium Refinements for Multi-Agent Influence Diagrams: Theory and Practice (Page 574)
Lewis Hammond (University of Oxford)
James Fox (University of Oxford)
Tom Everitt (DeepMind)
Alessandro Abate (University of Oxford)
Michael Wooldridge (University of Oxford)

Multi-Agent Reinforcement Learning with Temporal Logic Specifications (Page 583)
Lewis Hammond (University of Oxford)
Alessandro Abate (University of Oxford)
Julian Gutierrez (Monash University)
Michael Wooldridge (University of Oxford)

A Hotelling-Downs Framework for Party Nominees (Page 593)
Paul Harrenstein (University of Oxford)
Grzegorz Lisowski (University of Warwick)
Ramanujan Sridharan (University of Warwick)
Paolo Turrini (University of Warwick)

Cooperative-Competitive Reinforcement Learning with History-Dependent Rewards (Page 602)
Keyang He (University of Georgia)
Bikramjit Banerjee (University of Southern Mississippi)
Prashant Doshi (University of Georgia)

Learning Node-Selection Strategies in Bounded-Suboptimal Conflict-Based Search for Multi-Agent Path Finding (Page 611)
Taoan Huang (University of Southern California)
Bistra Dilkina (University of Southern California)
Sven Koenig (University of Southern California)

Show Me the Way: Intrinsic Motivation from Demonstrations (Page 620)
Léonard Hussenot (Google Research, Brain Team & Univ. Lille, CNRS, Inria Scool, UMR 9189 CRIStAL)
Robert Dadashi (Google Research, Brain Team)
Matthieu Geist (Google Research, Brain Team)
Olivier Pietquin (Google Research, Brain Team)

Action Advising with Advice Imitation in Deep Reinforcement Learning (Page 629)
Ercüment İlhan (Queen Mary University of London)
Jeremy Gow (Queen Mary University of London)
Diego Perez-Liebana (Queen Mary University of London)

Computing the Extremal Possible Ranks with Incomplete Preferences (Page 638)
Aviram Imber (Technion - Israel Institute of Technology)
Benny Kimelfeld (Technion - Israel Institute of Technology)

Probabilistic Inference of Winners in Elections by Independent Random Voters (Page 647)
Aviram Imber (Technion - Israel Institute of Technology)
Benny Kimelfeld (Technion - Israel Institute of Technology)

Trader-Company Method: A Metaheuristics for Interpretable Stock Price Prediction (Page 656)
Katsuya Ito (Preferred Networks, Inc.)
Kentaro Minami (Preferred Networks, Inc.)
Kentaro Imajo (Preferred Networks, Inc.)
Kei Nakagawa (Nomura Asset Management Co., Ltd.)

Partition Aggregation for Participatory Budgeting (Page 665)
Pallavi Jain (Indian Institute of Technology Jodhpur)
Nimrod Talmon (Ben-Gurion University of the Negev)
Laurent Bulteau (LIGM, CNRS, Univ Gustave Eiffel)

Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning (Page 674)
Zhengyao Jiang (University College London)
Pasquale Minervini (University College London)
Minqi Jiang (University College London)
Tim Rocktäschel (University College London)

Committee Selection using Attribute Approvals (Page 683)
Venkateswara Rao Kagita (National Institute of Technology)
Arun K. Pujari (Mahindra University)
Vineet Padmanabhan (University of Hyderabad)
Haris Aziz (University of New South Wales Sydney)
Vikas Kumar (University of Delhi & Central University of Rajasthan)

Mechanism Design for Housing Markets over Social Networks (Page 692)
Takehiro Kawasaki (Kyushu University)
Ryoji Wada (Kyushu University)
Taiki Todo (Kyushu University)
Makoto Yokoo (Kyushu University)

Knowing Why – On the Dynamics of Knowledge about Actual Causes in the Situation Calculus (Page 701)
Shakil M. Khan (Ronin Institute)
Yves Lespérance (York University)

Beyond "To Act or Not to Act": Fast Lagrangian Approaches to General Multi-Action Restless Bandits (Page 710)
Jackson A. Killian (Harvard University)
Andrew Perrault (Harvard University)
Milind Tambe (Harvard University)

Feasible Coalition Sequences (Page 719)
Tabajara Krausburg (Pontifical Catholic University of Rio Grande do Sul & Clausthal University of Technology)
Jürgen Dix (Clausthal University of Technology)
Rafael H. Bordini (Pontifical Catholic University of Rio Grande do Sul)

Adaptive Operating Hours for Improved Performance of Taxi Fleets (Page 728)
Rajiv Ranjan Kumar (Singapore Management University)
Pradeep Varakantham (Singapore Management University)
Shih-Fen Cheng (Singapore Management University)

Approval-Based Shortlisting (Page 737)
Martin Lackner (TU Wien)
Jan Maly (TU Wien)

Aggregating Bipolar Opinions (Page 746)
Stefan Lauren (Imperial College London)
Francesco Belardinelli (Imperial College London & Universite d'Evry)
Francesca Toni (Imperial College London)

The Price is (Probably) Right: Learning Market Equilibria from Samples (Page 755)
Omer Lev (Ben-Gurion University of the Negev)
Neel Patel (University of Southern California)
Vignesh Viswanathan (University of Massachusetts, Amherst)
Yair Zick (University of Massachusetts, Amherst)

Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning (Page 764)
Sheng Li (Stanford University)
Jayesh K. Gupta (Stanford University)
Peter Morales (Microsoft)
Ross Allen (Massachusetts Institute of Technology)
Mykel J. Kochenderfer (Stanford University)

Structured Diversification Emergence via Reinforced Organization Control and Hierachical Consensus Learning (Page 773)
Wenhao Li (East China Normal University)
Xiangfeng Wang (East China Normal University & SRIAS)
Bo Jin (East China Normal University & SRIAS)
Junjie Sheng (East China Normal University)
Yun Hua (East China Normal University)
Hongyuan Zha (RIAS & The Chinese University of Hong Kong)

Parallel Curriculum Experience Replay in Distributed Reinforcement Learning (Page 782)
Yuyu Li (University of Science and Technology of China)
Jianmin Ji (University of Science and Technology of China)

Let the DOCTOR Decide Whom to Test: Adaptive Testing Strategies to Tackle the COVID-19 Pandemic (Page 790)
Yu Liang (Penn State University)
Amulya Yadav (Penn State University)

Axies: Identifying and Evaluating Context-Specific Values (Page 799)
Enrico Liscio (Delft University of Technology)
Michiel van der Meer (Leiden University)
Luciano C. Siebert (Delft University of Technology)
Catholijn M. Jonker (Delft University of Technology)
Niek Mouter (Delft University of Technology)
Pradeep K. Murukannaiah (Delft University of Technology)

Energy-Based Imitation Learning (Page 809)
Minghuan Liu (Shanghai Jiao Tong University)
Tairan He (Shanghai Jiao Tong University)
Minkai Xu (Shanghai Jiao Tong University)
Weinan Zhang (Shanghai Jiao Tong University)

Deceptive Reinforcement Learning for Privacy-Preserving Planning (Page 818)
Zhengshang Liu (The University of Melbourne)
Yue Yang (The University of Melbourne)
Tim Miller (The University of Melbourne)
Peta Masters (The University of Melbourne)

A Logic of Evaluation (Page 827)
Emiliano Lorini (IRIT-CNRS, Toulouse University)

Exploration of Indoor Environments through Predicting the Layout of Partially Observed Rooms (Page 836)
Matteo Luperto (Università degli Studi di Milano)
Luca Fochetta (Politecnico di Milano)
Francesco Amigoni (Politecnico di Milano)

Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning (Page 844)
Xueguang Lyu (Northeastern University)
Yuchen Xiao (Northeastern University)
Brett Daley (Northeastern University)
Christopher Amato (Northeastern University)

Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning (Page 853)
Xiaoteng Ma (Tsinghua University)
Yiqin Yang (Tsinghua University)
Chenghao Li (Tsinghua University)
Yiwen Lu (Tsinghua University)
Qianchuan Zhao (Tsinghua University)
Jun Yang (Tsinghua University)

To hold or not to hold? - Reducing Passenger Missed Connections in Airlines using Reinforcement Learning (Page 862)
Tejasvi Malladi (TCS Research)
Karpagam Murugappan (TCS Research)
Depak Sudarsanam (TCS Research)
Ramasubramanian Suriyanarayanan (TCS Research)
Arunchandar Vasan (TCS Research)

(Return to Top)

Extended Goal Recognition: A Planning-Based Model for Strategic Deception (Page 871)
Peta Masters (The University of Melbourne)
Michael Kirley (The University of Melbourne)
Wally Smith (The University of Melbourne)

Risk-Aware Interventions in Public Health: Planning with Restless Multi-Armed Bandits (Page 880)
Aditya Mate (Harvard University)
Andrew Perrault (Harvard University)
Milind Tambe (Harvard University)

Identification of Unexpected Decisions in Partially Observable Monte-Carlo Planning: A Rule-Based Approach (Page 889)
Giulio Mazzi (Università degli Studi di Verona)
Alberto Castellini (Università degli Studi di Verona)
Alessandro Farinelli (Università degli Studi di Verona)

Cooperation between Independent Reinforcement Learners under Wealth Inequality and Collective Risks (Page 898)
Ramona Merhej (Universidade de Lisboa & Sorbonne University)
Fernando P. Santos (Princeton University & University of Amsterdam)
Francisco S. Melo (Universidade de Lisboa)
Francisco C. Santos (Universidade de Lisboa)

Value-Guided Synthesis of Parametric Normative Systems (Page 907)
Nieves Montes (Artificial Intelligence Research Institute (IIIA-CSIC))
Carles Sierra (Artificial Intelligence Research Institute (IIIA-CSIC))

ELVIRA: An Explainable Agent for Value and Utility-Driven Multiuser Privacy (Page 916)
Francesca Mosca (King's College London)
Jose M. Such (King's College London)

A Novelty-Centric Agent Architecture for Changing Worlds (Page 925)
Faizan Muhammad (Tufts University)
Vasanth Sarathy (Tufts University)
Gyan Tatiya (Tufts University)
Shivam Goel (Tufts University)
Saurav Gyawali (Tufts University)
Mateo Guaman (Tufts University)
Jivko Sinapov (Tufts University)
Matthias Scheutz (Tufts University)

Reward Machines for Cooperative Multi-Agent Reinforcement Learning (Page 934)
Cyrus Neary (The University of Texas at Austin)
Zhe Xu (Arizona State University)
Bo Wu (The University of Texas at Austin)
Ufuk Topcu (The University of Texas at Austin)

Adversarial Learning in Revenue-Maximizing Auctions (Page 955)
Thomas Nedelec (Criteo AI Lab & ENS Paris Saclay)
Jules Baudet (Ecole Polytechnique)
Vianney Perchet (ENSAE & Criteo AI Lab)
Noureddine El Karoui (Criteo AI Lab & University of California, Berkeley)

Multi-Agent Graph-Attention Communication and Teaming (Page 964)
Yaru Niu (Georgia Institute of Technology)
Rohan Paleja (Georgia Institute of Technology)
Matthew Gombolay (Georgia Institute of Technology)

Emergent Communication under Competition (Page 974)
Michael Noukhovitch (Université de Montréal)
Travis LaCroix (Université de Montréal & University of Toronto)
Angeliki Lazaridou (Deepmind)
Aaron Courville (Université de Montréal & CIFAR Fellow)

Safe Pareto Improvements for Delegated Game Playing (Page 983)
Caspar Oesterheld (Duke University)
Vincent Conitzer (Duke University)

Active Screening for Recurrent Diseases: A Reinforcement Learning Approach (Page 992)
Han-Ching Ou (Harvard University)
Haipeng Chen (Harvard University)
Shahin Jabbari (Harvard University)
Milind Tambe (Harvard University)

Group Fairness for Knapsack Problems (Page 1001)
Deval Patel (Indian Institute of Science)
Arindam Khan (Indian Institute of Science)
Anand Louis (Indian Institute of Science)

An Agent-Based Model to Predict Pedestrians Trajectories with an Autonomous Vehicle in Shared Spaces (Page 1010)
Manon Prédhumeau (University Grenoble Alpes, LIG)
Lyuba Mancheva (University Grenoble Alpes, LIG)
Julie Dugdale (University Grenoble Alpes, LIG)
Anne Spalanzani (University Grenoble Alpes, Inria)

Latency-Aware Local Search for Distributed Constraint Optimization (Page 1019)
Ben Rachmut (Ben-Gurion University of the Negev)
Roie Zivan (Ben-Gurion University of the Negev)
William Yeoh (Washington University in St. Louis)

Accelerating Recursive Partition-Based Causal Structure Learning (Page 1028)
Md. Musfiqur Rahman (University of Dhaka)
Ayman Rasheed (University of Dhaka)
Md. Mosaddek Khan (University of Dhaka)
Mohammad Ali Javidian (Purdue University)
Pooyan Jamshidi (University of South Carolina)
Md. Mamun-Or-Rashid (University of Dhaka)

Peer-to-peer Autonomous Agent Communication Network (Page 1037)
Lokman Rahmani (
David Minarsch (
Jonathan Ward (

Nash Equilibria in Finite-Horizon Multiagent Concurrent Games (Page 1046)
Senthil Rajasekaran (Rice University)
Moshe Y. Vardi (Rice University)

MAPFAST: A Deep Algorithm Selector for Multi Agent Path Finding using Shortest Path Embeddings (Page 1055)
Jingyao Ren (University of Southern California)
Vikraman Sathiyanarayanan (University of Southern California)
Eric Ewing (University of Southern California)
Baskin Senbaslar (University of Southern California)
Nora Ayanian (University of Southern California)

User and System Stories: An Agile Approach for Managing Requirements in AOSE (Page 1064)
Sebastian Rodriguez (RMIT University)
John Thangarajah (RMIT University)
Michael Winikoff (Victoria University of Wellington)

Accumulating Risk Capital Through Investing in Cooperation (Page 1073)
Charlotte Roman (University of California, Berkeley)
Michael Dennis (University of California, Berkeley)
Andrew Critch (University of California, Berkeley)
Stuart Russell (University of California, Berkeley)

TDprop: Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference Learning? (Page 1082)
Joshua Romoff (McGill University & Mila)
Peter Henderson (Stanford University)
David Kanaa (Polytechnique Montréal & Mila)
Emmanuel Bengio (McGill University & Mila)
Ahmed Touati (Université de Montréal & Mila)
Pierre-Luc Bacon (Université de Montréal & Mila)
Joelle Pineau (McGill University, MILA & Facebook)

Cooperative and Competitive Biases for Multi-Agent Reinforcement Learning (Page 1091)
Heechang Ryu (Korea Advanced Institute of Science and Technology)
Hayong Shin (Korea Advanced Institute of Science and Technology)
Jinkyoo Park (Korea Advanced Institute of Science and Technology)

SEERL: Sample Efficient Ensemble Reinforcement Learning (Page 1100)
Rohan Saphal (Indian Institute of Technology Madras)
Balaraman Ravindran (Indian Institute of Technology Madras)
Dheevatsa Mudigere (Facebook Inc.)
Sasikant Avancha (Intel Corporation)
Bharat Kaul (Intel Corporation)

Efficiently Guiding Imitation Learning Agents with Human Gaze (Page 1109)
Akanksha Saran (University of Texas at Austin)
Ruohan Zhang (University of Texas at Austin)
Elaine S. Short (Tufts University)
Scott Niekum (University of Texas at Austin)

SPOTTER: Extending Symbolic Planning Operators through Targeted Reinforcement Learning (Page 1118)
Vasanth Sarathy (Smart Information Flow Technologies)
Daniel Kasenberg (Tufts University)
Shivam Goel (Tufts University)
Jivko Sinapov (Tufts University)
Matthias Scheutz (Tufts University)

A Local Search Based Approach to Solve Continuous DCOPs (Page 1127)
Amit Sarker (University of Dhaka)
Moumita Choudhury (University of Dhaka)
Md. Mosaddek Khan (University of Dhaka)

CMCF: An Architecture for Realtime Gesture Generation by Clustering Gestures by Motion and Communicative Function (Page 1136)
Carolyn Saund (University of Glasgow)
Andrei Bîrlǎdeanu (University of Glasgow)
Stacy Marsella (Northeastern University)

Timely Information from Prediction Markets (Page 1145)
Grant Schoenebeck (University of Michigan)
Chenkai Yu (Tsinghua University)
Fang-Yi Yu (Harvard University)

Partial Robustness in Team Formation: Bridging the Gap between Robustness and Resilience (Page 1154)
Nicolas Schwind (National Institute of Advanced Industrial Science and Technology)
Emir Demirović (Delft University of Technology)
Katsumi Inoue (National Institute of Informatics & The Graduate University for Advanced Studies)
Jean-Marie Lagniez (CRIL-CNRS, Université d'Artois)

An Autonomous Negotiating Agent Framework with Reinforcement Learning based Strategies and Adaptive Strategy Switching Mechanism (Page 1163)
Ayan Sengupta (NEC Corporation)
Yasser Mohammad (NEC Corporation)
Shinji Nakadai (NEC Corporation)

Sequential Ski Rental Problem (Page 1173)
Anant Shah (Indian Institute of Technology Madras)
Arun Rajkumar (Indian Institute of Technology Madras)

Multiagent Epidemiologic Inference through Realtime Contact Tracing (Page 1182)
Guni Sharon (Texas A&M University)
James Ault (Texas A&M University)
Peter Stone (The University of Texas at Austin & Sony AI)
Varun Kompella (Sony AI)
Roberto Capobianco (Sony AI & Sapienza University of Rome)

Cooperative Policy Learning with Pre-trained Heterogeneous Observation Representations (Page 1191)
Wenlei Shi (Microsoft Research)
Xinran Wei (Beijing University of Posts and Telecommunications)
Jia Zhang (Microsoft Research)
Xiaoyuan Ni (Hong Kong University of Science and Technology)
Arthur Jiang (Microsoft ARD Incubation Team)
Jiang Bian (Microsoft Research)
Tie-Yan Liu (Microsoft Research)

Cyber Attack Intent Recognition and Active Deception using Factored Interactive POMDPs (Page 1200)
Aditya Shinde (University of Georgia)
Prashant Doshi (University of Georgia)
Omid Setayeshfar (University of Georgia)

Sequential Mechanisms for Multi-type Resource Allocation (Page 1209)
Sujoy Sikdar (Binghamton University)
Xiaoxi Guo (Peking University)
Haibin Wang (Peking University)
Lirong Xia (Rensselaer Polytechnic Institute)
Yongzhi Cao (Peking University)

Active Perception within BDI Agents Reasoning Cycle (Page 1218)
Gustavo R. Silva (Universidade Federal de Santa Catarina)
Jomi F. Hübner (Universidade Federal de Santa Catarina)
Leandro B. Becker (Universidade Federal de Santa Catarina)

AlwaysSafe: Reinforcement Learning without Safety Constraint Violations during Training (Page 1226)
Thiago D. Simão (Delft University of Technology)
Nils Jansen (Radboud University)
Matthijs T. J. Spaan (Delft University of Technology)

Rankings for Bipartite Tournaments via Chain Editing (Page 1236)
Joseph Singleton (Cardiff University)
Richard Booth (Cardiff University)

Towards Transferrable Personalized Student Models in Educational Games (Page 1245)
Samuel Spaulding (Massachusetts Institute of Technology)
Jocelyn Shen (Massachusetts Institute of Technology)
Haewon Park (Massachusetts Institute of Technology)
Cynthia Breazeal (Massachusetts Institute of Technology)

Regular Model Checking Approach to Knowledge Reasoning over Parameterized Systems (Page 1254)
Daniel Stan (Technical University of Kaiserslautern)
Anthony W. Lin (Technical University of Kaiserslautern, MPI SWS)

Achieving Sybil-Proofness in Distributed Work Systems (Page 1263)
Alexander Stannat (TU Delft)
Can Umut Ileri (TU Delft)
Dion Gijswijt (TU Delft)
Johan Pouwelse (TU Delft)

Mean-Payoff Games with ω-Regular Specifications (Page 1272)
Thomas Steeples (University of Oxford)
Julian Gutierrez (Monash University)
Michael Wooldridge (University of Oxford)

Connections between Fairness Criteria and Efficiency for Allocating Indivisible Chores (Page 1281)
Ankang Sun (University of Warwick)
Bo Chen (University of Warwick)
Xuan Vinh Doan (University of Warwick)

Grab the Reins of Crowds: Estimating the Effects of Crowd Movement Guidance Using Causal Inference (Page 1290)
Koh Takeuchi (Kyoto University)
Ryo Nishida (Tohoku University)
Hisashi Kashima (Kyoto University)
Masaki Onishi (AIST)

Adaptive Cascade Submodular Maximization (Page 1299)
Shaojie Tang (University of Texas at Dallas)
Jing Yuan (University of Texas at Dallas)

Learning Complex Policy Distribution with CEM Guided Adversarial Hypernetwork (Page 1308)
Shi Yuan Tang (Nanyang Technological University & Alibaba Group)
Athirai A. Irissappane (University of Washington)
Frans A. Oliehoek (Delft University of Technology)
Jie Zhang (Nanyang Technological University)

Guiding Evolutionary Strategies with Off-Policy Actor-Critic (Page 1317)
Yunhao Tang (Columbia University)

Efficient Exact Computation of Setwise Minimax Regret for Interactive Preference Elicitation (Page 1326)
Federico Toffano (University College Cork)
Paolo Viappiani (CNRS, LIP6 & Sorbonne Université)
Nic Wilson (University College Cork)

Collaborative Multiagent Decision Making for Lane-Free Autonomous Driving (Page 1335)
Dimitrios Troullinos (Technical University of Crete)
Georgios Chalkiadakis (Technical University of Crete)
Ioannis Papamichail (Technical University of Crete)
Markos Papageorgiou (Technical University of Crete)

No More Hand-Tuning Rewards: Masked Constrained Policy Optimization for Safe Reinforcement Learning (Page 1344)
Stef Van Havermaet (Ghent University - imec)
Yara Khaluf (Ghent University - imec)
Pieter Simoens (Ghent University - imec)

Reinforcement Learning for Unified Allocation and Patrolling in Signaling Games with Uncertainty (Page 1353)
Aravind Venugopal (Indian Institute of Technology, Madras)
Elizabeth Bondi (Harvard University)
Harshavardhan Kamarthi (Georgia Institute of Technology)
Keval Dholakia (Indian Institute of Technology, Madras)
Balaraman Ravindran (Indian Institute of Technology, Madras)
Milind Tambe (Harvard University)

Scalable Optimization for Wind Farm Control using Coordination Graphs (Page 1362)
Timothy Verstraeten (Vrije Universiteit Brussel)
Pieter-Jan Daems (Vrije Universiteit Brussel)
Eugenio Bargiacchi (Vrije Universiteit Brussel)
Diederik M. Roijers (Vrije Universiteit Brussel & HU University of Applied Sciences Utrecht)
Pieter J.K. Libin (Hasselt University)
Jan Helsen (Vrije Universiteit Brussel)

Fairness and Efficiency in Facility Location Problems with Continuous Demands (Page 1371)
Chenhao Wang (University of Nebraska-Lincoln)
Mengqi Zhang (University of Chinese Academy of Sciences & AMSS, Chinese Academy of Sciences)

Mechanism Design for Public Projects via Neural Networks (Page 1380)
Guanhua Wang (University of Adelaide)
Runqi Guo (University of Adelaide)
Yuko Sakurai (National Institute of Advanced Industrial Science and Technology)
Muhammad Ali Babar (University of Adelaide)
Mingyu Guo (University of Adelaide)

Strategic Evasion of Centrality Measures (Page 1389)
Marcin Waniek (New York University Abu Dhabi & University of Warsaw)
Jan Woźnica (University of Warsaw)
Kai Zhou (Hong Kong Polytechnic University)
Yevgeniy Vorobeychik (Washington University in St. Louis)
Talal Rahwan (New York University Abu Dhabi)
Tomasz P. Michalak (University of Warsaw)

Transferable Environment Poisoning: Training-time Attack on Reinforcement Learning (Page 1398)
Hang Xu (Nanyang Technological University)
Rundong Wang (Nanyang Technological University)
Lev Raizman (University of Waterloo)
Zinovi Rabinovich (Nanyang Technological University)

Drone Formation Control via Belief-Correlated Imitation Learning (Page 1407)
Bo Yang (Xidian University & Chinese Academy of Sciences)
Chaofan Ma (Zhongyuan University of Technology)
Xiaofang Xia (Xidian University)

Intention Progression using Quantitative Summary Information (Page 1416)
Yuan Yao (Zhejiang University of Technology)
Natasha Alechina (Utrecht University)
Brian Logan (Utrecht University)
John Thangarajah (RMIT University)

A Computational Model of Coping for Simulating Human Behavior in High-Stress Situations (Page 1425)
Nutchanon Yongsatianchot (Northeastern University)
Stacy Marsella (Northeastern University)

Evolution of Strategies in Sequential Security Games (Page 1434)
Adam Źychowski (Warsaw University of Technology)
Jacek Mańdziuk (Warsaw University of Technology)

(Return to Top)

Extended Abstracts

How to Amend a Constitution? Model, Axioms, and Supermajority Rules (Page 1443)
Ben Abramowitz (Weizmann Institute of Science)
Ehud Shapiro (Weizmann Institute of Science)
Nimrod Talmon (Ben-Gurion University of the Negev)

Learning Competitive Equilibria in Noisy Combinatorial Markets (Page 1446)
Enrique Areyan Viqueira (Brown University)
Cyrus Cousins (Brown University)
Amy Greenwald (Brown University)

Interpretive Blindness and the Impossibility of Learning from Testimony (Page 1449)
Nicholas Asher (CNRS/IRIT)
Julie Hunter (LINAGORA Labs)

Quantifying Human Perception with Multi-Armed Bandits (Page 1452)
Julien Audiffren (University of Fribourg)

Modelling Cooperation in Network Games with Spatio-Temporal Complexity (Page 1455)
Michiel A. Bakker (DeepMind & Massachusetts Institute of Technology)
Richard Everett (DeepMind)
Laura Weidinger (DeepMind)
Iason Gabriel (DeepMind)
William S. Isaac (DeepMind)
Joel Z. Leibo (DeepMind)
Edward Hughes (DeepMind)

Image Sequence Understanding through Narrative Sensemaking (Page 1458)
Zev Battad (Rensselaer Polytechnic Institute)
Mei Si (Rensselaer Polytechnic Institute)

Maximizing Influence-Based Group Shapley Centrality (Page 1461)
Ruben Becker (Gran Sasso Science Institute)
Gianlorenzo D'Angelo (Gran Sasso Science Institute)
Hugo Gilbert (Université Paris-Dauphine, Université PSL, CNRS, LAMSADE)

How to Guide a Non-Cooperative Learner to Cooperate: Exploiting No-Regret Algorithms in System Design (Page 1464)
Nicholas Bishop (University of Southampton)
Le Cong Dinh (University of Southampton)
Long Tran-Thanh (University of Warwick)

Learning Index Policies for Restless Bandits with Application to Maternal Healthcare (Page 1467)
Arpita Biswas (Google Research)
Gaurav Aggarwal (Google Research)
Pradeep Varakantham (Google Research)
Milind Tambe (Google Research)

CHARET: Character-centered Approach to Emotion Tracking in Stories (Page 1469)
Diogo S. Carvalho (INESC-ID & Universidade de Lisboa)
Joana Campos (INESC-ID)
Manuel Guimarães (INESC-ID & Universidade de Lisboa)
Ana Antunes (INESC-ID & Universidade de Lisboa)
João Dias (INESC-ID, Universidade do Algarve & CCMAR)
Pedro A. Santos (INESC-ID & Universidade de Lisboa)

On the Sensory Commutativity of Action Sequences for Embodied Agents (Page 1472)
Hugo Caselles-Dupré (U2IS, ENSTA Paris, Institut Polytechnique de Paris & INRIA)
Michael Garcia-Ortiz (CitAI, SMCSE, City University of London)
David Filliat (U2IS, ENSTA Paris, Institut Polytechnique de Paris & INRIA)

Difference Rewards Policy Gradients (Page 1475)
Jacopo Castellini (University of Liverpool)
Sam Devlin (Microsoft Research Cambridge)
Frans A. Oliehoek (Delft University of Technology)
Rahul Savani (University of Liverpool)

Learning to Cooperate with Unseen Agents Through Meta-Reinforcement Learning (Page 1478)
Rujikorn Charakorn (Vidyasirimedhi Institute of Science and Technology (VISTEC))
Poramate Manoonpong (Vidyasirimedhi Institute of Science and Technology (VISTEC))
Nat Dilokthanakul (Vidyasirimedhi Institute of Science and Technology (VISTEC))

Promoting Fair Proposers, Fair Responders or Both? Cost-Efficient Interference in the Spatial Ultimatum Game (Page 1480)
Theodor Cimpeanu (Teesside University)
Cedric Perret (University of Exeter)
The Anh Han (Teesside Univeresity)

A Logic of Inferable in Multi-Agent Systems with Budget and Costs (Page 1483)
Stefania Costantini (University of L'Aquila)
Andrea Formisano (University of Udine)
Valentina Pitoni (University of L'Aquila)

Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning (Page 1486)
Brett Daley (Northeastern University)
Cameron Hickert (Harvard University)
Christopher Amato (Northeastern University)

A Generic Multi-Agent Model for Resource Allocation Strategies in Online On-Demand Transport with Autonomous Vehicles (Page 1489)
Alaa Daoud (Mines Saint-Étienne, CNRS, UMR 6158, LIMOS Institut Henri Fayol)
Flavien Balbo (Mines Saint-Étienne, CNRS, UMR 6158, LIMOS Institut Henri Fayol)
Paolo Gianessi (Mines Saint-Étienne, CNRS, UMR 6158, LIMOS Institut Henri Fayol)
Gauthier Picard (ONERA/DTIS & Université de Toulouse)

A Multi-Arm Bandit Approach To Subset Selection Under Constraints (Page 1492)
Ayush Deva (International Institute of Information Technology Hyderabad)
Kumar Abhishek (International Institute of Information Technology Hyderabad)
Sujit Gujar (International Institute of Information Technology Hyderabad)

It's A Match! Gesture Generation Using Expressive Parameter Matching (Page 1495)
Ylva Ferstl (Trinity College Dublin)
Michael Neff (University of California Davis)
Rachel McDonnell (Trinity College Dublin)

Partially Cooperative Multi-Agent Periodic Indivisible Resource Allocation (Page 1498)
Yuval Gabai Schlosberg (Ben Gurion University of the Negev)
Roie Zivan (Ben Gurion University of the Negev)

Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity (Page 1501)
Marta Garnelo (DeepMind)
Wojciech Marian Czarnecki (DeepMind)
Siqi Liu (DeepMind)
Dhruva Tirumala (DeepMind)
Junhyuk Oh (DeepMind)
Gauthier Gidel (DeepMind)
Hado van Hasselt (DeepMind)
David Balduzzi (DeepMind)

Towards a Competence-Based Approach to Allocate Teams to Tasks (Page 1504)
Athina Georgara (Artificial Intelligence Research Institute (IIIA-CSIC))
Juan A. Rodríguez-Aguilar (Artificial Intelligence Research Institute (IIIA-CSIC))
Carles Sierra (Artificial Intelligence Research Institute (IIIA-CSIC))

Shielding Atari Games with Bounded Prescience (Page 1507)
Mirco Giacobbe (University of Oxford)
Mohammadhosein Hasanbeig (University of Oxford)
Daniel Kroening (Amazon, Inc.)
Hjalmar Wijk (University of Oxford)

Comparison of Desynchronization Methods for a Decentralized Swarm on a Logistical Resupply Problem (Page 1510)
Joseph P. Giordano (University of Central Florida)
Annie S. Wu (University of Central Florida)
Arjun Pherwani (University of Central Florida)
H. David Mathias (University of Wisconsin-La Crosse)

Towards Decentralized Social Reinforcement Learning via Ego-Network Extrapolation (Page 1512)
Mahak Goindani (Purdue University)
Jennifer Neville (Purdue University)

A Global Multi-Sided Market with Ascending-Price Mechanism (Page 1515)
Rica Gonen (The Open University of Israel)
Erel Segal-Halevi (Ariel University)

Rank Aggregation by Dissatisfaction Minimisation in the Unavailable Candidate Model (Page 1518)
Arnaud Grivet Sébert (Université Paris-Saclay, CEA, List)
Nicolas Maudet (LIP6, UMR 7606, Sorbonne Université)
Patrice Perny (LIP6, UMR 7606, Sorbonne Université)
Paolo Viappiani (LIP6, UMR 7606, CNRS & Sorbonne Université)

Sequential and Swap Mechanisms for Public Housing Allocation with Quotas and Neighbourhood-Based Utilities (Page 1521)
Nathanaël Gross-Humbert (LIP6, Sorbonne Université, CNRS)
Nawal Benabbou (LIP6, Sorbonne Université, CNRS)
Aurélie Beynier (LIP6, Sorbonne Université, CNRS)
Nicolas Maudet (LIP6, Sorbonne Université, CNRS)

Teaching Unknown Learners to Classify via Feature Importance (Page 1524)
Carla Guerra (INESC-ID & Universidade de Lisboa)
Francisco S. Melo (INESC-ID & Universidade de Lisboa)
Manuel Lopes (INESC-ID & Universidade de Lisboa)

Simultaneous Learning of Moving and Active Perceptual Policies for Autonomous Robot (Page 1527)
Wataru Hatanaka (RICOH COMPANY, LTD.)
Fumihiro Sasaki (RICOH COMPANY, LTD.)
Ryota Yamashina (RICOH COMPANY, LTD.)
Atsuo Kawaguchi (RICOH COMPANY, LTD.)

Distributional Monte Carlo Tree Search for Risk-Aware and Multi-Objective Reinforcement Learning (Page 1530)
Conor F. Hayes (National University of Ireland Galway)
Mathieu Reymond (Vrije Universiteit Brussel)
Diederik M. Roijers (Vrije Universiteit Brussel (BE) & HU University of Appl. Science Utrecht (NL))
Enda Howley (National University of Ireland Galway)
Patrick Mannion (National University of Ireland Galway)

Approximating Spatial Evolutionary Games using Bayesian Networks (Page 1533)
Vincent Hsiao (University of Maryland)
Xinyue Pan (University of Maryland)
Dana Nau (University of Maryland)
Rina Dechter (University of California, Irvine)

Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments (Page 1536)
Dmitry Ivanov (JetBrains Research & HSE University)
Vladimir Egorov (JetBrains Research & HSE University)
Aleksei Shpilman (JetBrains Research & HSE University)

We might walk together, but I run faster: Network Fairness and Scalability in Blockchains (Page 1539)
Anurag Jain (International Institute of Information Technology, Hyderabad)
Shoeb Siddiqui (International Institute of Information Technology, Hyderabad)
Sujit Gujar (International Institute of Information Technology, Hyderabad)

Preserving Consistency for Liquid Knapsack Voting (Page 1542)
Pallavi Jain (Indian Institute of Technology Jodhupar)
Krzysztof Sornat (Massachusetts Institute of Technology)
Nimrod Talmon (Ben-Gurion University of the Negev)

Strategic Abilities of Asynchronous Agents: Semantic Side Effects (Page 1545)
Wojciech Jamroga (Polish Academy of Sciences & University of Luxembourg)
Wojciech Penczek (Polish Academy of Sciences)
Teofil Sidoruk (Polish Academy of Sciences & Warsaw University of Technology)

Solving 3D Bin Packing Problem via Multimodal Deep Reinforcement Learning (Page 1548)
Yuan Jiang (Nanyang Technological University)
Zhiguang Cao (National University of Singapore)
Jie Zhang (Nanyang Technological University)

Toward Consistent Agreement Approximation in Abstract Argumentation and Beyond (Page 1551)
Timotheus Kampik (Umeå University)
Juan Carlos Nieves (Umeå University)

Coverage Control under Connectivity Constraints (Page 1554)
Shota Kawajiri (Mitsubishi Electric Corporation)
Kazuki Hirashima (Mitsubishi Electric Corporation)
Masashi Shiraishi (Mitsubishi Electric Corporation)

Solver Agent: Towards Emotional and Opponent-Aware Agent for Human-Robot Negotiation (Page 1557)
Mehmet Onur Keskin (Özyeğin University)
Umut Çakan (Özyeğin University)
Reyhan Aydoğan (Özyeğin University)

Evaluating the Robustness of Collaborative Agents (Page 1560)
Paul Knott (University of Nottingham)
Micah Carroll (University of California, Berkeley)
Sam Devlin (Microsoft Research)
Kamil Ciosek (Microsoft Research)
Katja Hofmann (Microsoft Research)
Anca Dragan (University of California, Berkeley)
Rohin Shah (University of California, Berkeley)

On Weakly and Strongly Popular Rankings (Page 1563)
Sonja Kraiczy (University of Oxford)
Ágnes Cseh (University of Potsdam)
David Manlove (University of Glasgow)

Fairness in Long-Term Participatory Budgeting (Page 1566)
Martin Lackner (TU Wien)
Jan Maly (TU Wien)
Simon Rey (University of Amsterdam)

RPPLNS: Pay-per-last-N-shares with a Randomised Twist (Page 1569)
Philip Lazos (Sapienza University of Rome)
Francisco J. Marmolejo Cossío (University of Oxford & IOHK)
Xinyu Zhou (University of Maryland)
Jonathan Katz (University of Maryland)

Learning Cooperative Solution Concepts from Voting Behavior: A Case Study on the Israeli Knesset (Page 1572)
Omer Lev (Ben-Gurion University of the Negev)
Wei Lu (Massachusetts Institute of Technology)
Alan Tsang (Carleton University)
Yair Zick (University of Massachusetts, Amherst)

Partial Disclosure of Private Dependencies in Privacy Preserving Planning (Page 1575)
Rotem Lev Lehman (Ben Gurion University of the Negev)
Guy Shani (Ben Gurion University of the Negev)
Roni Stern (Palo Alto Research Center & Ben Gurion University of the Negev)

Object Allocation Over a Network of Objects: Mobile Agents with Strict Preferences (Page 1578)
Fu Li (University of Texas at Austin)
C. Gregory Plaxton (University of Texas at Austin)
Vaibhav B. Sinha (University of Texas at Austin)

Anytime Multi-Agent Path Finding via Large Neighborhood Search (Page 1581)
Jiaoyang Li (University of Southern California)
Zhe Chen (Monash University)
Daniel Harabor (Monash University)
Peter J. Stuckey (Monash University)
Sven Koenig (University of Southern California)

Reliability-Aware Multi-UAV Coverage Path Planning using a Genetic Algorithm (Page 1584)
Mickey Li (University of Bristol)
Arthur Richards (University of Bristol)
Mahesh Sooriyabandara (Toshiba Research Europe Ltd.)

Call Markets with Adaptive Clearing Intervals (Page 1587)
Buhong Liu (King's College London)
Maria Polukarov (King's College London)
Carmine Ventre (King's College London)
Lingbo Li (Turing Intelligence Technology)
Leslie Kanthan (Turing Intelligence Technology)

Solid Semantics and Extension Aggregation Using Quota Rules under Integrity Constraints (Page 1590)
Xiaolong Liu (Sun Yat-sen University)
Weiwei Chen (Sun Yat-sen University)

Trajectory Diversity for Zero-Shot Coordination (Page 1593)
Andrei Lupu (McGill University & Mila)
Hengyuan Hu (Facebook AI Research)
Jakob Foerster (Facebook AI Research)

Optimized Execution of PDDL Plans using Behavior Trees (Page 1596)
Francisco Martín Rico (Rey Juan Carlos University)
Matteo Morelli (CEA list)
Huascar Espinoza (CEA list)
Francisco J. Rodríguez-Lera (University of León)
Vicente Matellán Olivera (University of León)

A Strategic Analysis of Portfolio Compression (Page 1599)
Katherine Mayo (University of Michigan)
Michael P. Wellman (University of Michigan)

A General Framework for the Logical Representation of Combinatorial Exchange Protocols (Page 1602)
Munyque Mittelmann (Université de Toulouse - IRIT)
Sylvain Bouveret (Université Grenoble Alpes - LIG)
Laurent Perrussel (Université de Toulouse - IRIT)

A Privacy-Preserving and Accountable Multi-agent Learning Framework (Page 1605)
Anudit Nagar (Bennett University)
Cuong Tran (Syracuse University)
Ferdinando Fioretto (Syracuse University)

SIBRE: Self Improvement Based REwards for Adaptive Feedback in Reinforcement Learning (Page 1607)
Somjit Nath (TCS Research)
Richa Verma (TCS Research)
Abhik Ray (BTS-Pilani (Goa))
Harshad Khadilkar (TCS Research)

Tunable Behaviours in Sequential Social Dilemmas using Multi-Objective Reinforcement Learning (Page 1610)
David O'Callaghan (National University of Ireland Galway)
Patrick Mannion (National University of Ireland Galway)

Online Learning of Shaping Reward with Subgoal Knowledge (Page 1613)
Takato Okudo (The Graduate University of Advanced Studies & SOKENDAI)
Seiji Yamada (National Institute of Informatics, NII, Graduate University for Advanced Studies & SOKENDAI )

Attention Actor-Critic Algorithm for Multi-Agent Constrained Co-operative Reinforcement Learning (Page 1616)
P. Parnika (MindTree Ltd., India)
Raghuram Bharadwaj Diddigi (Indian Institute of Science, India)
Sai Koti Reddy Danda (IBM Research, India)
Shalabh Bhatnagar (Indian Institute of Science, India)

Toward a Self-Learning Governance Loop for Competitive Multi-Attribute MAS (Page 1619)
Michael Pernpeintner (University of Mannheim)

Personalising the Dialogue of Relational Agents for First-Time Users (Page 1622)
Hedieh Ranjbartabar (Macquarie University)
Deborah Richards (Macquarie University)
Ayse Aysin Bilgin (Macquarie University)
Cat Kutay (Charles Darwin University)

Finite-time Consensus in the Presence of Malicious Agents (Page 1625)
Sachit Rao (International Institute of Information Technology - Bangalore)
Shrisha Rao (International Institute of Information Technology - Bangalore)

Multiagent Task Allocation and Planning with Multi-Objective Requirements (Page 1628)
Thomas Robinson (University of Wollongong)
Guoxin Su (University of Wollongong)
Minjie Zhang (University of Wollongong)

An Autonomous Drive Balancing Strategy for the Design of Purpose in Open-ended Learning Robots (Page 1631)
Alejandro Romero (Universidade da Coruña)
Francisco Bellas (Universidade da Coruña)
Richard J. Duro (Universidade da Coruña)

Combining LSTMs and Symbolic Approaches for Robust Plan Recognition (Page 1634)
Leonardo Rosa Amado (Pontifical Catholic University of Rio Grande do Sul)
Ramon Fraga Pereira (Sapienza University of Rome)
Felipe Meneguzzi (Pontifical Catholic University of Rio Grande do Sul)

Dynamic Skill Selection for Learning Joint Actions (Page 1637)
Enna Sachdeva (Oregon State University)
Shauharda Khadka (MIcrosoft)
Somdeb Majumdar (Intel Labs)
Kagan Tumer (Oregon State University)

Mitigating Negative Side Effects via Environment Shaping (Page 1640)
Sandhya Saisubramanian (University of Massachusetts, Amherst)
Shlomo Zilberstein (University of Massachusetts, Amherst)

Social Network Interventions to Prevent Reciprocity-driven Polarization (Page 1643)
Fernando P. Santos (Princeton University & University of Amsterdam)
Francisco C. Santos (INESC-ID & Universidade de Lisboa)
Jorge M. Pacheco (Universidade do Minho)
Simon A. Levin (Princeton University)

HOAD: The Hanabi Open Agent Dataset (Page 1646)
Aron Sarmasi (University of California, Davis)
Timothy Zhang (University of California, Davis)
Chu-Hung Cheng (University of California, Davis)
Huyen Pham (University of California, Davis)
Xuanchen Zhou (University of California, Davis)
Duong Nguyen (University of California, Davis)
Soumil Shekdar (University of California, Davis)
Joshua McCoy (University of California, Davis)

Egalitarian and Just Digital Currency Networks (Page 1649)
Gal Shahaf (Weizmann Institute of Science)
Ehud Shapiro (Weizmann Institute of Science)
Nimrod Talmon (Ben-Gurion University of the Negev)

MAS-Bench: Parameter Optimization Benchmark for Multi-agent Crowd Simulation (Page 1652)
Shusuke Shigenaka (University of Tsukuba)
Shunki Takami (University of Tsukuba)
Shuhei Watanabe (University of Freiburg)
Yuki Tanigaki (National Institute of Advanced Industrial Science and Technology)
Yoshihiko Ozaki (National Institute of Advanced Industrial Science and Technology & GREE, Inc.)
Masaki Onishi (National Institute of Advanced Industrial Science and Technology)

Approximate Difference Rewards for Scalable Multigent Reinforcement Learning (Page 1655)
Arambam James Singh (Singapore Management University)
Akshat Kumar (Singapore Management University)
Hoong Chuin Lau (Singapore Management University)

Self-Attention Meta-Learner for Continual Learning (Page 1658)
Ghada Sokar (Eindhoven University of Technology)
Decebal Constantin Mocanu (University of Twente & Eindhoven University of Technology)
Mykola Pechenizkiy (Eindhoven University of Technology)

A Succinct Representation Scheme for Cooperative Games under Uncertainty (Page 1661)
Errikos Streviniotis (Technical University of Crete)
Athina Georgara (Artificial Intelligence Research Institute (IIIA-CSIC))
Georgios Chalkiadakis (Technical University of Crete)

Gambler Bandits and the Regret of Being Ruined (Page 1664)
Filipo Studzinski Perotto (University of Toulouse)
Sattar Vakili (MediaTek Research)
Pratik Gajane (University of Leoben)
Yaser Faghan (University of Lisbon)
Mathieu Bourgais (INSA of Rouen)

Intrinsic Motivated Multi-Agent Communication (Page 1668)
Chuxiong Sun (The Institute of Software, Chinese Academy of Sciences)
Bo Wu (The Institute of Software, Chinese Academy of Sciences)
Rui Wang (The Institute of Software, Chinese Academy of Sciences)
Xiaohui Hu (The Institute of Software, Chinese Academy of Sciences)
Xiaoya Yang (The Institute of Software, Chinese Academy of Sciences)
Cong Cong (The Institute of Software, Chinese Academy of Sciences)

A Distributional Perspective on Value Function Factorization Methods for Multi-Agent Reinforcement Learning (Page 1671)
Wei-Fang Sun (National Tsing Hua University)
Cheng-Kuang Lee (NVIDIA Corporation)
Chun-Yi Lee (National Tsing Hua University)

Sound Algorithms in Imperfect Information Games (Page 1674)
Michal Šustr (Czech Technical University & DeepMind)
Martin Schmid (DeepMind)
Matej Moravćík (DeepMind)
Neil Burch (DeepMind)
Marc Lanctot (DeepMind)
Michael Bowling (DeepMind)

Branch-and-Bound Heuristics for Incomplete DCOPs (Page 1677)
Atena M. Tabakhi (Washington University in St. Louis)
Yuanming Xiao (Washington University in St. Louis)
William Yeoh (Washington University in St. Louis)
Roie Zivan (Ben Gurion University of the Negev)

Cohorting to Isolate Asymptomatic Spreaders: An Agent-Based Simulation Study on the Mumbai Suburban Railway (Page 1680)
Alok Talekar (Google Inc.)
Sharad Shriram (Indian Institute of Science, Bengaluru)
Nidhin Vaidhiyan (Indian Institute of Science, Bengaluru)
Gaurav Aggarwal (Google Inc.)
Jiangzhuo Chen (University of Virginia)
Srini Venkatramanan (University of Virginia)
Lijing Wang (University of Virginia)
Aniruddha Adiga (University of Virginia)
Adam Sadilek (Google Inc.)
Ashish Tendulkar (Google Inc.)
Madhav Marathe (University of Virginia)
Rajesh Sundaresan (Indian Institute of Science, Bengaluru & Strand Life Sciences)
Milind Tambe (Google Inc.)

Eliciting Fairness in Multiplayer Bargaining through Network-Based Role Assignment (Page 1683)
Andreia Sofia Teixeira (INESC-ID and Hospital da Luz Learning Health Luz Saúde & Indiana University Network Science Institute)
Francisco C. Santos (Universidade de Lisboa)
Alexandre P. Francisco (Universidade de Lisboa)
Fernando P. Santos (Princeton University & University of Amsterdam)

Learning Robust Helpful Behaviors in Two-Player Cooperative Atari Environments (Page 1686)
Paul Tylkin (Harvard University)
Goran Radanovic (Max Planck Institute for Software Systems)
David C. Parkes (Harvard University)

Towards Sample Efficient Learners in Population based Referential Games through Action Advising (Page 1689)
Shresth Verma (ABV-Indian Institute of Information Technology and Management Gwalior)

Distributed Q-Learning with State Tracking for Multi-agent Networked Control (Page 1692)
Hang Wang (Arizona State University)
Sen Lin (Arizona State University)
Hamid Jafarkhani (University of California, Irvine)
Junshan Zhang (Arizona State University)

The Tight Bound for Pure Price of Anarchy in an Extended Miner's Dilemma Game (Page 1695)
Qian Wang (Peking University)
Yurong Chen (Peking University)

The Sabre Narrative Planner: Multi-Agent Coordination with Intentions and Beliefs (Page 1698)
Stephen G. Ware (University of Kentucky)
Cory Siler (University of Kentucky)

Learning Policies for Effective Incentive Allocation in Unknown Social Networks (Page 1701)
Shiqing Wu (University of Tasmania)
Quan Bai (University of Tasmania)
Weihua Li (Auckland University of Technology)

Optimal Crowdfunding Design (Page 1704)
Xiang Yan (Shanghai Jiao Tong University)
Yiling Chen (Harvard University)

A Blockchain-Enabled Quantitative Approach to Trust and Reputation Management with Sparse Evidence (Page 1707)
Leonit Zeynalvand (Nanyang Technological University)
Tie Luo (Missouri University of Science and Technology)
Ewa Andrejczuk (Nanyang Technological University)
Dusit Niyato (Nanyang Technological University)
Sin G. Teo (Institute for Infocomm Research)
Jie Zhang (Nanyang Technological University)

Fast Adaptation to External Agents via Meta Imitation Counterfactual Regret Advantage (Page 1709)
Mingyue Zhang (Peking University)
Zhi Jin (Peking University)
Yang Xu (University of Electronic Science and Technology of China)
Zehan Shen (Nanjing University)
Kun Liu (Peking University)
Keyu Pan (University of Electronic Science and Technology of China)

Deep Interactive Bayesian Reinforcement Learning via Meta-Learning (Page 1712)
Luisa Zintgraf (University of Oxford & Microsoft Research)
Sam Devlin (Microsoft Research)
Kamil Ciosek (Microsoft Research)
Shimon Whiteson (University of Oxford)
Katja Hofmann (Microsoft Research)

(Return to Top)


A Norm Enforcement Mechanism for a Time-Constrained Conditional Normative Framework (Page 1715)
Babatunde Opeoluwa Akinkunmi (University of Ibadan)
Florence Moyin Babalola (The Polytechnic, Ibadan)

Agent Programming in the Cognitive Era (Page 1718)
Rafael H. Bordini (Pontifical Catholic University of Rio Grande do Sul)
Amal El Fallah Seghrouchni (Sorbonne Université)
Koen Hindriks (Vrije Universiteit Amsterdam)
Brian Logan (University of Nottingham)
Alessandro Ricci (University of Bologna)

Logic-based Technologies for Multi-agent Systems: Summary of a Systematic Literature Review (Page 1721)
Roberta Calegari (Università di Bologna)
Giovanni Ciatto (Università di Bologna)
Viviana Mascardi (Università di Genova)
Andrea Omicini (Università di Bologna)

Programming Agent-based Mobile Apps: The JaCa-Android Framework (Page 1724)
Angelo Croatti (Alma Mater Studiorum - Università di Bologna)
Alessandro Ricci (Alma Mater Studiorum - Università di Bologna)

Logic-based Specification and Verification of Homogeneous Dynamic Multi-agent Systems (Page 1727)
Riccardo De Masellis (Uppsala University)
Valentin Goranko (Stockholm University)

On Teammate-Pattern-Aware Autonomy (Page 1730)
Edmund H. Durfee (University of Michigan)
Abhishek Thakur (BRINC Drones)
Eli Goldweber (University of Michigan)

Summarising a Framework for the Certification of Reliable Autonomous Systems (Page 1733)
Michael Fisher (University of Manchester)
Viviana Mascardi (University of Genova)
Kristin Y. Rozier (Iowa State University)
Bernd-Holger Schlingloff (Humboldt University & FOKUS)
Michael Winikoff (Victoria University of Wellington)
Neil Yorke-Smith (Delft University of Technology)

Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER Framework (Page 1735)
Guangliang Li (Ocean University of China)
Hamdi Dibeklioğlu (Bilkent University)
Shimon Whiteson (University of Oxford)
Hayley Hung (Delft University of Technology)

Teaching a Robot with Unlabeled Instructions: The TICS Architecture (Page 1738)
Anis Najar (Laboratoire de Neurosciences Cognitives Computationnelles)
Olivier Sigaud (Sorbonne Université)
Mohamed Chetouani (Sorbonne Université)

STRATA: Unified Framework for Task Assignments in Large Teams of Heterogeneous Agents (Page 1740)
Harish Ravichandar (Georgia Institute of Technology)
Kenneth Shaw (Carnegie Mellon University)
Sonia Chernova (Georgia Institute of Technology)

A Decentralised Self-Healing Approach for Network Topology Maintenance (Page 1743)
Arles Rodríguez (Fundación Universitaria Konrad Lorenz)
Jonatan Gómez (Universidad Nacional de Colombia)
Ada Diaconescu (Telecom Paris, LTCI & Institut Polytechnique de Paris)

Constructing Junction Tree Agent Organization with Privacy (Page 1746)
Yang Xiang (University of Guelph)
Abdulrahman Alshememry (King Saud University)

(Return to Top)

Demonstration Track

ADT2AMAS: Managing Agents in Attack-Defence Scenarios (Page 1749)
Jaime Arias (LIPN, CNRS UMR 7030, Université Sorbonne Paris Nord)
Wojciech Penczek (Institute of Computer Science, Polish Academy of Sciences)
Laure Petrucci (LIPN, CNRS UMR 7030, Université Sorbonne Paris Nord)
Teofil Sidoruk (Institute of Computer Science, Polish Academy of Sciences & Warsaw University of Technology)

Distributing Responsibilities for Exception Handling in JaCaMo (Page 1752)
Matteo Baldoni (Università di Torino)
Cristina Baroglio (Università di Torino)
Olivier Boissier (Institut Henri Fayol)
Roberto Micalizio (Università di Torino)
Stefano Tedeschi (Università di Torino)

Ship-GAN: Generative Modeling Based Maritime Traffic Simulator (Page 1755)
Chaithanya Basrur (Singapore Management University)
Arambam James Singh (Singapore Management University)
Arunesh Sinha (Singapore Management University)
Akshat Kumar (Singapore Management University)

An Online Human-Agent Interaction System: A Brain-controlled Agent Playing Games in Unity (Page 1758)
Zehong Cao (University of Tasmania)
Jie Yun (University of Tasmania)

Argflow: A Toolkit for Deep Argumentative Explanations for Neural Networks (Page 1761)
Adam Dejl (Imperial College London)
Peter He (Imperial College London)
Pranav Mangal (Imperial College London)
Hasan Mohsin (Imperial College London)
Bogdan Surdu (Imperial College London)
Eduard Voinea (Imperial College London)
Emanuele Albini (Imperial College London)
Piyawat Lertvittayakumjorn (Imperial College London)
Antonio Rago (Imperial College London)
Francesca Toni (Imperial College London)

Strategy RV: A Tool to Approximate ATL Model Checking under Imperfect Information and Perfect Recall (Page 1764)
Angelo Ferrando (The University of Manchester)
Vadim Malvone (Télécom Paris)

Autonomous Agents on the Edge of Things (Page 1767)
Timotheus Kampik (Umeå University)
Andres Gomez (University of St. Gallen)
Andrei Ciortea (University of St. Gallen)
Simon Mayer (University of St. Gallen)

STV+Reductions: Towards Practical Verification of Strategic Ability Using Model Reductions (Page 1770)
Damian Kurpiewski (Polish Academy of Sciences)
Witold Pazderski (Polish Academy of Sciences)
Wojciech Jamroga (Polish Academy of Sciences & University of Luxembourg)
Yan Kim (University of Luxembourg)

A Collaborative Platform for Identifying Context-Specific Values (Page 1773)
Enrico Liscio (Delft University of Technology)
Michiel van der Meer (Leiden University)
Catholijn M. Jonker (Delft University of Technology)
Pradeep K. Murukannaiah (Delft University of Technology)

Scalable Multiple Robot Task Planning with Plan Merging and Conflict Resolution (Page 1776)
Gilberto Marcon dos Santos (Oregon State University)
Julie A. Adams (Oregon State University)

A Framework for Integrating Gesture Generation Models into Interactive Conversational Agents (Page 1779)
Rajmund Nagy (KTH Royal Institute of Technology)
Taras Kucherenko (KTH Royal Institute of Technology)
Birger Moell (KTH Royal Institute of Technology)
André Pereira (KTH Royal Institute of Technology)
Hedvig Kjellström (KTH Royal Institute of Technology)
Ulysses Bernardet (Aston University)

Symbolic Reinforcement Learning for Safe RAN Control (Page 1782)
Alexandros Nikou (Ericsson Research)
Anusha Mujumdar (Ericsson Research)
Marin Orlić (Ericsson Research)
Aneta Vulgarakis Feljan (Ericsson Research)

TORS: A Train Unit Shunting and Servicing Simulator (Page 1785)
Jacobus G.M. van der Linden (Delft University of Technology)
Jesse Mulderij (Delft University of Technology)
Bob Huisman (Nederlandse Spoorwegen)
Joris W. den Ouden (Nederlandse Spoorwegen)
Marjan van den Akker (Utrecht University)
Han Hoogeveen (Utrecht University)
Mathijs M. de Weerdt (Delft University of Technology)

ATPT: Automate Typhoon Contingency Plan Generation from Text (Page 1788)
Yifeng Zeng (Northumbria University)
Zhangrui Yao (Xiamen University)
Yinghui Pan (Shenzhen University)
Wanqing Chen (Shenzhen University)
Junxin Zhou (Shenzhen University)
Junhan Chen (Xiamen University)
Biyang Ma (Northumbria University)
Zhong Ming (Shenzhen University)

Graph-based Self-Adaptive Conversational Agent (Page 1791)
Lan Zhang (Auckland University of Technology)
Weihua Li (Auckland University of Technology)
Quan Bai (University of Tasmania)
Edmund Lai (Auckland University of Technology)

(Return to Top)

Doctoral Consortium

Exploring the Relationship Between Social Choice and Machine Learning (Page 1794)
Ben Armstrong (University of Waterloo)

Understanding the Role of Inequality in Creating and Sustaining the Alcohol Harm Paradox using Agent-Based Modelling (Page 1797)
Jennifer Boyd (University of Sheffield)

Computing Desirable Outcomes in Specific Multi-Agent Scenarios (Page 1799)
Martin Bullinger (Technische Universität München)

Multi-Agent Ranked Delegations in Voting (Page 1802)
Rachael Colley (University of Toulouse)

Learning Realistic and Safe Pedestrian Behavior by Imitation (Page 1805)
José Aleixo Cruz (University of Porto)

Reward-Sharing Relational Networks in Multi-Agent Reinforcement Learning as a Framework for Emergent Behavior (Page 1808)
Hossein Haeri (University of Massachusetts, Lowell)

Impact of Recommender Systems on the Dynamics of Users' Choices (Page 1811)
Naieme Hazrati (Free University of Bolzano)

Software Engineering Methods for Responsible Artificial Intelligence (Page 1814)
Zahoor Ul Islam (Umeå University)

Leveraging Social Interactions in Human-Agent Decision-Making (Page 1816)
JiHyun Jeong (Cornell University)

Balancing Fairness, Efficiency and Strategy-Proofness in Voting and Facility Location Problems (Page 1818)
Alexander Lam (University of New South Wales Sydney)

Intention-Aware Human-Robot Collaborative Design (Page 1820)
Matthew V. Law (Cornell University)

Non-manipulability in Set-valued and Probabilistic Social Choice Theory (Page 1823)
Patrick Lederer (Technische Universität München)

Modelling Trust in Human-AI Interaction (Page 1826)
Siddharth Mehrotra (Delft University of Technology)

Simulating Realistic Pedestrian Behaviors in the Context of Autonomous Vehicles in Shared Spaces (Page 1829)
Manon Prédhumeau (Grenoble Alps University, LIG)

Exploiting Hidden Convexities for Real-time and Reliable Optimization Algorithms for Challenging Motion Planning and Control Applications (Page 1832)
Fatemeh Rastgar (University of Tartu)

Adaptable and Verifiable BDI Reasoning (Page 1835)
Peter Stringer (The University of Manchester)

Improving Sample-based Reinforcement Learning through Complex Non-parametric Distributions (Page 1837)
Shi Yuan Tang (Nanyang Technological University)

A Concurrent Language for Negotiation and Debate with Argumentation (Page 1840)
Carlo Taticchi (Gran Sasso Science Institute)

Computing using Samples: Theoretical Guarantees with the Direct Learning Approach (Page 1842)
Vignesh Viswanathan (University of Massachusetts, Amherst)

Inducing Rules about Distributed Robotic Systems for Fault Detection & Diagnosis (Page 1845)
Youssef Mahmoud Youssef (Hochschule Bonn-Rhein-Sieg)

Design and Analysis of Networks under Strategic Behavior (Page 1848)
Sixie Yu (Washington University in St. Louis)

Mechanism Design in Facility Location Games (Page 1850)
Mengqi Zhang (University of Chinese Academy of Sciences & AMSS, Chinese Academy of Sciences)