Main Page

Table of Contents

Author Index

Table of Contents

AAMAS'22 Chairs Welcome
Piotr Faliszewski
Viviana Mascardi
Catherine Pelachaud
Matthew E. Taylor

Conference Organization

Area Chairs

Senior Programme Committee

Programme Committee

Auxiliary Reviewers

Special Track Reviewers 

Awards

Sponsors & Supporters

 

Main Track

Blue Sky Ideas Track

Demonstration Track

Extended Abstracts

Doctoral Consortium

JAAMAS Track

Main Track

Using Agent-Based Simulator to Assess Interventions Against COVID-19 in a Small Community Generated from Map Data (Page 1)
Mitsuteru Abe (University of Tsukuba)
Fabio Tanaka (University of Tsukuba)
Jair Pereira Junior (University of Tsukuba)
Anna Bogdanova (University of Tsukuba)
Tetsuya Sakurai (University of Tsukuba)
Claus Aranha (University of Tsukuba)

Multi-Objective Reinforcement Learning with Non-Linear Scalarization (Page 9)
Mridul Agarwal (Purdue University)
Vaneet Aggarwal (Purdue University)
Tian Lan (George Washington University)

Be Considerate: Avoiding Negative Side Effects in Reinforcement Learning (Page 18)
Parand Alizadeh Alamdari (University of Toronto & Vector Institute)
Toryn Q. Klassen (University of Toronto & Vector Institute)
Rodrigo Toro Icarte (Pontificia Universidad Católica de Chile & Vector Institute)
Sheila A. McIlraith (University of Toronto & Vector Institute)

Hacking the Colony: On the Disruptive Effect of Misleading Pheromone and How to Defend against It (Page 27)
Ashay Aswale (Worcester Polytechnic Institute)
Antonio López (Worcester Polytechnic Institute)
Aukkawut Ammartayakun (Worcester Polytechnic Institute)
Carlo Pinciroli (Worcester Polytechnic Institute)

(Return to Top)

State Supervised Steering Function for Sampling-based Kinodynamic Planning (Page 35)
Pranav Atreya (University of Texas at Austin)
Joydeep Biswas (University of Texas at Austin)

Unbiased Asymmetric Reinforcement Learning under Partial Observability (Page 44)
Andrea Baisero (Northeastern University)
Christopher Amato (Northeastern University)

Multi-Agent Heterogeneous Digital Twin Framework with Dynamic Responsibility Allocation for Complex Task Simulation (Page 53)
Adrian Simon Bauer (German Aerospace Center (DLR) & Robotics and Mechatronics Center (RMC))
Anne Köpken (German Aerospace Center (DLR) & Robotics and Mechatronics Center (RMC))
Daniel Leidner (German Aerospace Center (DLR) & Robotics and Mechatronics Center (RMC))

Reasoning about Human-Friendly Strategies in Repeated Keyword Auctions (Page 62)
Francesco Belardinelli (Université d'Evry)
Wojtek Jamroga (University of Luxembourg & Institute of Computer Science, Polish Academy of Sciences)
Vadim Malvone (Télécom Paris)
Munyque Mittelmann (Université de Toulouse - IRIT)
Aniello Murano (University of Naples Federico II)
Laurent Perrussel (Université de Toulouse - IRIT)

COPALZ: A Computational Model of Pathological Appraisal Biases for an Interactive Virtual Alzheimer Patient (Page 72)
Amine Benamara (CNRS-LISN, Université Paris-Saclay)
Jean-Claude Martin (CNRS-LISN, Université Paris-Saclay)
Elise Prigent (CNRS-LISN, Université Paris-Saclay)
Laurence Chaby (Sorbonne Université)
Mohamed Chetouani (Sorbonne Université)
Jean Zagdoun (Sorbonne Université)
Hélène Vanderstichel (CIREL - EA 4354, Université de Lille)
Sébastien Dacunha (Hôpitaux de Paris)
Brian Ravenet (CNRS-LISN, Université Paris-Saclay)

(Return to Top)

Computing Balanced Solutions for Large International Kidney Exchange Schemes (Page 82)
Márton Benedek (KRTK, Institute of Economics)
Péter Biró (KRTK, Institute of Economics)
Walter Kern (University of Twente)
Daniël Paulusma (Durham University)

Agent-based Modeling and Simulation for Malware Spreading in D2D Networks (Page 91)
Ziyad Benomar (Orange Labs)
Chaima Ghribi (Orange Labs)
Elie Cali (Orange Labs)
Alexander Hinsen (Weierstrass Institute for Applied Analysis and Stochastics)
Benedikt Jahnel (Weierstrass Institute for Applied Analysis and Stochastics)

Quantitative Group Trust: A Two-Stage Verification Approach (Page 100)
Jamal Bentahar (Concordia University)
Nagat Drawel (Concordia University)
Abdeladim Sadiki (Concordia University)

Asynchronous Opinion Dynamics in Social Networks (Page 109)
Petra Berenbrink (Universität Hamburg)
Martin Hoefer (Goethe University Frankfurt)
Dominik Kaaser (Universität Hamburg)
Pascal Lenzner (Hasso Plattner Institute)
Malin Rau (Universität Hamburg)
Daniel Schmand (Universität Bremen)

Interpretable Preference-based Reinforcement Learning with Tree-Structured Reward Functions (Page 118)
Tom Bewley (University of Bristol)
Freddy Lecue (CortAIx, Thales)

Multivariate Algorithmics for Eliminating Envy by Donating Goods (Page 127)
Niclas Boehmer (TU Berlin)
Robert Bredereck (Humboldt-Universität zu Berlin)
Klaus Heeger (TU Berlin)
Dušan Knop (Czech Technical University in Prague)
Junjie Luo (Nanyang Technological University)

Proportional Representation in Matching Markets: Selecting Multiple Matchings under Dichotomous Preferences (Page 136)
Niclas Boehmer (TU Berlin)
Markus Brill (TU Berlin)
Ulrike Schmidt-Kraepelin (TU Berlin)

(Return to Top)

A Hierarchical Bayesian Process for Inverse RL in Partially-Controlled Environments (Page 145)
Kenneth Bogert (University of North Carolina at Asheville)
Prashant Doshi (University of Georgia)

Little House (Seat) on the Prairie: Compactness, Gerrymandering, and Population Distribution (Page 154)
Allan Borodin (University of Toronto)
Omer Lev (Ben-Gurion University of the Negev)
Nisarg Shah (University of Toronto)
Tyrone Strangway (Ben-Gurion University of the Negev)

Knowledge Transmission and Improvement Across Generations do not Need Strong Selection (Page 163)
Yasser Bourahla (University Grenoble Alpes, Inria, CNRS, Grenoble INP, LIG)
Manuel Atencia (Universidad de Málaga)
Jérôme Euzenat (University Grenoble Alpes, Inria, CNRS, Grenoble INP, LIG)

Explainability in Multi-Agent Path/Motion Planning: User-study-driven Taxonomy and Requirements (Page 172)
Martim Brandao (King's College London)
Masoumeh Mansouri (University of Birmingham)
Areeb Mohammed (King's College London)
Paul Luff (King's College London)
Amanda Coles (King's College London)

Relaxed Notions of Condorcet-Consistency and Efficiency for Strategyproof Social Decision Schemes (Page 181)
Felix Brandt (Technical University of Munich)
Patrick Lederer (Technical University of Munich)
René Romen (Technical University of Munich)

Fair Stable Matching Meets Correlated Preferences (Page 190)
Angelina Brilliantova (Rochester Institute of Technology)
Hadi Hosseini (The Pennsylvania State University)

Exploiting Causal Structure for Transportability in Online, Multi-Agent Environments (Page 199)
Axel Browne (Loyola Marymount University)
Andrew Forney (Loyola Marymount University)

(Return to Top)

Beyond Cake Cutting: Allocating Homogeneous Divisible Goods (Page 208)
Ioannis Caragiannis (Aarhus University)
Vasilis Gkatzelis (Drexel University)
Alexandros Psomas (Perdue University)
Daniel Schoepflin (Drexel University)

Planning, Execution, and Adaptation for Multi-Robot Systems using Probabilistic and Temporal Planning (Page 217)
Yaniel Carreno (Heriot-Watt University & The University of Edinburgh)
Jun Hao Alvin Ng (Heriot-Watt University & The University of Edinburgh)
Yvan Petillot (Heriot-Watt University & The University of Edinburgh)
Ron Petrick (Heriot-Watt University & The University of Edinburgh)

Bayesian Persuasion Meets Mechanism Design: Going Beyond Intractability with Type Reporting (Page 226)
Matteo Castiglioni (Politecnico di Milano)
Alberto Marchesi (Politecnico di Milano)
Nicola Gatti (Politecnico di Milano)

Best-Response Bayesian Reinforcement Learning with Bayes-adaptive POMDPs for Centaurs (Page 235)
Mustafa Mert Çelikok (Aalto University)
Frans A. Oliehoek (Delft University of Technology)
Samuel Kaski (Aalto University & University of Manchester)

Anomaly Guided Policy Learning from Imperfect Demonstrations (Page 244)
Zi-Xuan Chen (Nanjing University)
Xin-Qiang Cai (Nanjing University)
Yuan Jiang (Nanjing University)
Zhi-Hua Zhou (Nanjing University)

Individual-Level Inverse Reinforcement Learning for Mean Field Games (Page 253)
Yang Chen (The University of Auckland)
Libo Zhang (The University of Auckland)
Jiamou Liu (The University of Auckland)
Shuyue Hu (National University of Singapore)

Simulating Multiwinner Voting Rules in Judgment Aggregation (Page 263)
Julian Chingoma (University of Amsterdam)
Ulle Endriss (University of Amsterdam)
Ronald de Haan (University of Amsterdam)

Coordinated Multi-Agent Pathfinding for Drones and Trucks over Road Networks (Page 272)
Shushman Choudhury (Stanford University)
Kiril Solovey (Stanford University)
Mykel Kochenderfer (Stanford University)
Marco Pavone (Stanford University)

(Return to Top)

Pippi: Practical Protocol Instantiation (Page 281)
Samuel H. Christie (North Carolina State University)
Amit K. Chopra (Lancaster University)
Munindar P. Singh (North Carolina State University)

Optimizing Multi-Agent Coordination via Hierarchical Graph Probabilistic Recursive Reasoning (Page 290)
Saar Cohen (Bar-Ilan University)
Noa Agmon (Bar-Ilan University)

Pareto Optimal and Popular House Allocation with Lower and Upper Quotas (Page 300)
Ágnes Cseh (Institute of Economics, Centre for Economic and Regional Studies)
Tobias Friedrich (Hasso Plattner Institute, University of Potsdam)
Jannik Peters (TU Berlin)

Three-Dimensional Popular Matching with Cyclic Preferences (Page 309)
Ágnes Cseh (Institute of Economics, Centre for Economic and Regional Studies)
Jannik Peters (TU Berlin)

Poincaré-Bendixson Limit Sets in Multi-Agent Learning (Page 318)
Aleksander Czechowski (Delft University of Technology)
Georgios Piliouras (Singapore University of Technology and Design)

A Distributed Differentially Private Algorithm for Resource Allocation in Unboundedly Large Settings (Page 327)
Panayiotis Danassis (École Polytechnique Fédérale de Lausanne (EPFL))
Aleksei Triastcyn (École Polytechnique Fédérale de Lausanne (EPFL))
Boi Faltings (École Polytechnique Fédérale de Lausanne (EPFL))

Computation and Bribery of Voting Power in Delegative Simple Games (Page 336)
Gianlorenzo D'Angelo (Gran Sasso Science Institute)
Esmaeil Delfaraz (Gran Sasso Science Institute)
Hugo Gilbert (Université Paris-Dauphine, Université PSL, CNRS, LAMSADE)

(Return to Top)

Budgeted Combinatorial Multi-Armed Bandits (Page 345)
Debojit Das (International Institute of Information Technology, Hyderabad)
Shweta Jain (Indian Institute of Technology, Ropar)
Sujit Gujar (International Institute of Information Technology, Hyderabad)

Efficient Approximation Algorithms for the Inverse Semivalue Problem (Page 354)
Ilias Diakonikolas (University of Wisconsin-Madison)
Chrystalla Pavlou (TurinTech AI)
John Peebles (Princeton University)
Alistair Stewart (Web 3 Foundation)

Multiagent Dynamics of Gradual Argumentation Semantics (Page 363)
Louise Dupuis de Tarlé (Université Paris-Dauphine)
Elise Bonzon (Université de Paris)
Nicolas Maudet (Sorbonne Université, CNRS)

How to Fairly Allocate Easy and Difficult Chores (Page 372)
Soroush Ebadian (University of Toronto)
Dominik Peters (University of Toronto)
Nisarg Shah (University of Toronto)

Scalable Multi-Agent Model-Based Reinforcement Learning (Page 381)
Vladimir Egorov (JetBrains Research & HSE University)
Alexei Shpilman (JetBrains Research & HSE University)

Facility Location With Approval Preferences: Strategyproofness and Fairness (Page 391)
Edith Elkind (University of Oxford)
Minming Li (City University of Hong Kong)
Houyu Zhou (City University of Hong Kong)

Betweenness Centrality in Multi-Agent Path Finding (Page 400)
Eric Ewing (University of Southern California)
Jingyao Ren (University of Southern California)
Dhvani Kansara (University of Southern California)
Vikraman Sathiyanarayanan (University of Southern California)
Nora Ayanian (University of Southern California)

(Return to Top)

Welfare vs. Representation in Participatory Budgeting (Page 409)
Roy Fairstein (Ben Gurion University of the Negev)
Dan Vilenchik (Ben Gurion University of the Negev)
Reshef Meir (Technion-Israel Institute of Technology)
Kobi Gal (Ben Gurion University of the Negev & University of Edinburgh)

A Path-following Polynomial Equations Systems Approach for Computing Nash Equilibria (Page 418)
Hélène Fargier (Université de Toulouse, IRIT)
Paul Jourdan (Université de Toulouse, INRAE-MIAT)
Régis Sabbadin (Université de Toulouse, INRAE-MIAT)

Ensemble and Incremental Learning for Norm Violation Detection (Page 427)
Thiago Freitas dos Santos (Artificial Intelligence Research Institute (IIIA-CSIC) & Universitat Autònoma de Barcelona)
Nardine Osman (Artificial Intelligence Research Institute (IIIA-CSIC))
Marco Schorlemmer (Artificial Intelligence Research Institute (IIIA-CSIC))

The Price of Majority Support (Page 436)
Robin Fritsch (ETH Zürich)
Roger Wattenhofer (ETH Zürich)

A Symbolic Representation for Probabilistic Dynamic Epistemic Logic (Page 445)
Sébastien Gamblin (Normandie University, UNICAEN, ENSICAEN, CNRS, GREYC)
Alexandre Niveau (Normandie University, UNICAEN, ENSICAEN, CNRS, GREYC)
Maroua Bouzid (Normandie University, UNICAEN, ENSICAEN, CNRS, GREYC)

Fully-Autonomous, Vision-based Traffic Signal Control: From Simulation to Reality (Page 454)
Deepeka Garg (Aston University)
Maria Chli (Aston University)
George Vogiatzis (Aston University)

One-Sided Matching Markets with Endowments: Equilibria and Algorithms (Page 463)
Jugal Garg (University of Illinois at Urbana-Champaign)
Thorben Tröbst (University of California, Irvine)
Vijay V. Vazirani (University of California, Irvine)

(Return to Top)

Negotiated Path Planning for Non-Cooperative Multi-Robot Systems (Page 472)
Anna Gautier (University of Oxford)
Alex Stephens (University of Oxford)
Bruno Lacerda (University of Oxford)
Nick Hawes (University of Oxford)
Michael Wooldridge (University of Oxford)

Refined Hardness of Distance-Optimal Multi-Agent Path Finding (Page 481)
Tzvika Geft (Tel Aviv University)
Dan Halperin (Tel Aviv University)

Concave Utility Reinforcement Learning: The Mean-field Game Viewpoint (Page 489)
Matthieu Geist (Google)
Julien Pérolat (Deepmind)
Mathieu Laurière (Google)
Romuald Elie (Deepmind)
Sarah Perrin (Univ. Lille, CNRS, Inria, Centrale Lille)
Oliver Bachem (Google)
Rémi Munos (Deepmind)
Olivier Pietquin (Google)

D3C: Reducing the Price of Anarchy in Multi-Agent Learning (Page 498)
Ian Gemp (DeepMind)
Kevin R. McKee (DeepMind)
Richard Everett (DeepMind)
Edgar Duéñez-Guzmán (DeepMind)
Yoram Bachrach (DeepMind)
David Balduzzi (XTX Markets)
Andrea Tacchetti (DeepMind)

Sample-based Approximation of Nash in Large Many-Player Games via Gradient Descent (Page 507)
Ian Gemp (DeepMind)
Rahul Savani (University of Liverpool)
Marc Lanctot (DeepMind)
Yoram Bachrach (DeepMind)
Thomas Anthony (DeepMind)
Richard Everett (DeepMind)
Andrea Tacchetti (DeepMind)
Tom Eccles (DeepMind)
János Kramár (DeepMind)

Building Contrastive Explanations for Multi-Agent Team Formation (Page 516)
Athina Georgara (Artificial Intelligence Research Institute (IIIA-CSIC) & Enzyme Advising Group)
Juan A. Rodriguez Aguilar (Artificial Intelligence Research Institute (IIIA-CSIC))
Carles Sierra (Artificial Intelligence Research Institute (IIIA-CSIC))

Long-Term Resource Allocation Fairness in Average Markov Decision Process (AMDP) Environment (Page 525)
Ganesh Ghalme (Technion Israel Institute of Technology)
Vineet Nair (Technion Israel Institute of Technology)
Vishakha Patil (Indian Institute of Science)
Yilun Zhou (Massachusetts Institute of Technology)

(Return to Top)

Fair and Truthful Mechanism with Limited Subsidy (Page 534)
Hiromichi Goko (Toyota Motor Corporation)
Ayumi Igarashi (National Institute of Informatics)
Yasushi Kawase (University of Tokyo)
Kazuhisa Makino (Kyoto University)
Hanna Sumita (Tokyo Institute of Technology)
Akihisa Tamura (Keio University)
Yu Yokoi (National Institute of Informatics)
Makoto Yokoo (Kyushu University)

Robust No-Regret Learning in Min-Max Stackelberg Games (Page 543)
Denizalp Goktas (Brown University)
Jiayi Zhao (Pomona College)
Amy Greenwald (Brown University)

Multi-Agent Curricula and Emergent Implicit Signaling (Page 553)
Niko A. Grupen (Cornell University)
Daniel D. Lee (Cornell Tech)
Bart Selman (Cornell University)

Intention-Aware Navigation in Crowds with Extended-Space POMDP Planning (Page 562)
Himanshu Gupta (University of Colorado Boulder)
Bradley Hayes (University of Colorado Boulder)
Zachary Sunberg (University of Colorado Boulder)

Multiagent Model-based Credit Assignment for Continuous Control (Page 571)
Dongge Han (University of Oxford)
Chris Xiaoxuan Lu (University of Edinburgh)
Tomasz Michalak (University at Warsaw & IDEAS NCBR)
Michael Wooldridge (University of Oxford)

Hierarchical Value Decomposition for Effective On-demand Ride-Pooling (Page 580)
Jiang Hao (Singapore Management University)
Pradeep Varakantham (Singapore Management University)

Computing Nash Equilibria for District-based Nominations (Page 588)
Paul Harrenstein (University of Oxford)
Paolo Turrini (University of Warwick)

Ordinal Maximin Share Approximation for Chores (Page 597)
Hadi Hosseini (The Pennsylvania State University)
Andrew Searns (Johns Hopkins University)
Erel Segal-Halevi (Ariel University)

(Return to Top)

A Mean Field Game Model of Spatial Evolutionary Games (Page 606)
Vincent Hsiao (University of Maryland)
Dana Nau (University of Maryland)

The Dynamics of Q-learning in Population Games: A Physics-inspired Continuity Equation Model (Page 615)
Shuyue Hu (National University of Singapore)
Chin-Wing Leung (The Chinese University of Hong Kong)
Ho-fung Leung (The Chinese University of Hong Kong)
Harold Soh (National University of Singapore)

Reduction-based Solving of Multi-agent Pathfinding on Large Maps Using Graph Pruning (Page 624)
Matej Husár (Charles University)
Jiří Švancara (Charles University)
Philipp Obermeier (University of Potsdam)
Roman Barták (Charles University)
Torsten Schaub (Potassco Solutions & University of Potsdam)

Autonomous Swarm Shepherding Using Curriculum-Based Reinforcement Learning (Page 633)
Aya Hussein (University of New South Wales)
Eleni Petraki (University of Canberra)
Sondoss Elsawah (University of New South Wales)
Hussein A. Abbass (University of New South Wales)

Cascades and Overexposure in Social Networks: The Budgeted Case (Page 642)
Mohammad T. Irfan (Bowdoin College)
Kim Hancock (IBM)
Laura M. Friel (Bowdoin College)

Being Central on the Cheap: Stability in Heterogeneous Multiagent Centrality Games (Page 651)
Gabriel Istrate (West University of Timişoara)
Cosmin Bonchiş (West University of Timişoara)

A Declarative Framework for Maximal k-plex Enumeration Problems (Page 660)
Said Jabbour (CRIL CNRS - Université d'Artois)
Nizar Mhadhbi (INSY2S)
Badran Raddaoui (Télécom SudParis & Institut Polytechnique de Paris)
Lakhdar Sais (CRIL CNRS - Université d'Artois)

(Return to Top)

Lazy-MDPs: Towards Interpretable RL by Learning When to Act (Page 669)
Alexis Jacq (Google Research)
Johan Ferret (Google Research, Inria, & Université de Lille)
Olivier Pietquin (Google Research)
Matthieu Geist (Google Research)

Balancing Fairness and Efficiency in Traffic Routing via Interpolated Traffic Assignment (Page 678)
Devansh Jalota (Stanford University)
Kiril Solovey (Technion - Israel Institute of Technology)
Matthew Tsao (Stanford University)
Stephen Zoepf (Lacuna AI)
Marco Pavone (Stanford University)

Selecting PhD Students and Projects with Limited Funding (Page 687)
Jatin Jindal (Google)
Jérôme Lang (CNRS, PSL)
Katarína Cechlárová (Pavol Jozef Šafárik University)
Julien Lesca (Huawei Technologies)

Optimal Matchings with One-Sided Preferences: Fixed and Cost-Based Quotas (Page 696)
Santhini K. A. (Indian Institute of Technology Madras)
Govind S. Sankar (Duke University)
Meghana Nasre (Indian Institute of Technology Madras)

Planning Not to Talk: Multiagent Systems that are Robust to Communication Loss (Page 705)
Mustafa O. Karabag (The University of Texas at Austin)
Cyrus Neary (The University of Texas at Austin)
Ufuk Topcu (The University of Texas at Austin)

How Hard is Safe Bribery? (Page 714)
Neel Karia (Microsoft Research)
Faraaz Mallick (Indian Institute of Technology, Kharagpur)
Palash Dey (Indian Institute of Technology, Kharagpur)

BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs (Page 723)
Sammie Katt (Northeastern University)
Hai Nguyen (Northeastern University)
Frans A. Oliehoek (Delft University of Technology)
Christopher Amato (Northeastern University)

(Return to Top)

Translating Omega-Regular Specifications to Average Objectives for Model-Free Reinforcement Learning (Page 732)
Milad Kazemi (Newcastle University)
Mateo Perez (University of Colorado Boulder)
Fabio Somenzi (University of Colorado Boulder)
Sadegh Soudjani (Newcastle University)
Ashutosh Trivedi (University of Colorado Boulder)
Alvaro Velasquez (Air Force Research Laboratory)

Tactile Pose Estimation and Policy Learning for Unknown Object Manipulation (Page 742)
Tarik Kelestemur (Northeastern University)
Robert Platt (Northeastern University)
Taskin Padir (Northeastern University)

Disentangling Successor Features for Coordination in Multi-agent Reinforcement Learning (Page 751)
Seung Hyun Kim (University of Illinois at Urbana Champaign)
Neale Van Stralen (University of Illinois at Urbana-Champaign)
Girish Chowdhary (University of Illinois at Urbana-Champaign)
Huy T. Tran (University of Illinois at Urbana-Champaign)

Equilibria in Schelling Games: Computational Hardness and Robustness (Page 761)
Luca Kreisel (TU Berlin)
Niclas Boehmer (TU Berlin)
Vincent Froese (TU Berlin)
Rolf Niedermeier (TU Berlin)

Multimodal Analysis of the Predictability of Hand-gesture Properties (Page 770)
Taras Kucherenko (KTH Royal Institute of Technology)
Rajmund Nagy (KTH Royal Institute of Technology)
Michael Neff (University of California, Davis)
Hedvig Kjellström (KTH Royal Institute of Technology)
Gustav Eje Henter (KTH Royal Institute of Technology)

Towards Pluralistic Value Alignment: Aggregating Value Systems Through lp-Regression (Page 780)
Roger Lera-Leri (IIIA-CSIC)
Filippo Bistaffa (IIIA-CSIC)
Marc Serramia (IIIA-CSIC)
Maite Lopez-Sanchez (Universitat de Barcelona)
Juan Rodriguez-Aguilar (IIIA-CSIC)

Deploying Vaccine Distribution Sites for Improved Accessibility and Equity to Support Pandemic Response (Page 789)
George Z. Li (University of Maryland)
Ann Li (University of Virginia)
Madhav Marathe (University of Virginia)
Aravind Srinivasan (University of Maryland)
Leonidas Tsepenekas (University of Maryland)
Anil Vullikanti (University of Virginia)

(Return to Top)

ASM-PPO: Asynchronous and Scalable Multi-Agent PPO for Cooperative Charging (Page 798)
Yongheng Liang (Sun Yat-sen University)
Hejun Wu (Sun Yat-sen University)
Haitao Wang (Sun Yat-sen University)

Equilibrium Computation For Knockout Tournaments Played By Groups (Page 807)
Grzegorz Lisowski (University of Warwick)
M. S. Ramanujan (University of Warwick)
Paolo Turrini (University of Warwick)

Residual Entropy-based Graph Generative Algorithms (Page 816)
Wencong Liu (Beijing Institute of Technology & Southeast Institute of Information Technology)
Jiamou Liu (The University of Auckland)
Zijian Zhang (Beijing Institute of Technology & Southeast Institute of Information Technology)
Yiwei Liu (Defence Industry Secrecy Examination and Certification Center)
Liehuang Zhu (Beijing Institute of Technology)

The Spoofing Resistance of Frequent Call Markets (Page 825)
Buhong Liu (King's College London)
Maria Polukarov (King's College London)
Carmine Ventre (King's College London)
Lingbo Li (Turing Intelligence Technology)
Leslie Kanthan (Turing Intelligence Technology)
Fan Wu (Turing Intelligence Technology)
Michail Basios (Turing Intelligence Technology)

Logical Theories of Collective Attitudes and the Belief Base Perspective (Page 833)
Emiliano Lorini (IRIT, CNRS, Toulouse University)
Éloan Rapion (ENS Rennes)

Lyapunov Exponents for Diversity in Differentiable Games (Page 842)
Jonathan Lorraine (University of Toronto)
Paul Vicol (University of Toronto)
Jack Parker-Holder (University of Oxford)
Tal Kachman (Radboud University)
Luke Metz (Google Research)
Jakob Foerster (University of Oxford)

Any-Play: An Intrinsic Augmentation for Zero-Shot Coordination (Page 853)
Keane Lucas (Carnegie Mellon University)
Ross E. Allen (Massachusetts Institute of Technology)

(Return to Top)

Coalition Formation Games and Social Ranking Solutions (Page 862)
Roberto Lucchetti (Politecnico di Milano)
Stefano Moretti (Université Paris-Dauphine & Université PSL)
Tommaso Rea (Politecnico di Milano)

On Parameterized Complexity of Binary Networked Public Goods Game (Page 871)
Arnab Maiti (Indian Institute of Technology Kharagpur)
Palash Dey (Indian Institute of Technology Kharagpur)

Efficient Algorithms for Finite Horizon and Streaming Restless Multi-Armed Bandit Problems (Page 880)
Aditya S. Mate (Harvard University)
Arpita Biswas (Harvard University)
Christoph Siebenbrunner (Harvard University)
Susobhan Ghosh (Harvard University)
Milind Tambe (Harvard University)

CAPS: Comprehensible Abstract Policy Summaries for Explaining Reinforcement Learning Agents (Page 889)
Joe McCalmon (Wake Forest University)
Thai Le (The Pennsylvania State University)
Sarra Alqahtani (Wake Forest University)
Dongwon Lee (The Pennsylvania State University)

Warmth and Competence in Human-Agent Cooperation (Page 898)
Kevin R. McKee (DeepMind)
Xuechunzi Bai (Princeton University)
Susan T. Fiske (Princeton University)

Cooperation and Learning Dynamics under Risk Diversity and Financial Incentives (Page 908)
Ramona Merhej (Instituto Superior Tecnico & Sorbonne University)
Fernando P. Santos (University of Amsterdam)
Francisco S. Melo (INESC-ID and Instituto Superior Tecnico, Universidade de Lisboa)
Mohamed Chetouani (Sorbonne University)
Francisco C. Santos (INESC-ID and Instituto Superior Tecnico, Universidade de Lisboa)

Preference-Based Goal Refinement in BDI Agents (Page 917)
Mostafa Mohajeriparizi (University of Amsterdam)
Giovanni Sileno (University of Amsterdam)
Tom van Engers (University of Amsterdam)

(Return to Top)

Learning Equilibria in Mean-Field Games: Introducing Mean-Field PSRO (Page 926)
Paul Muller (Deepmind)
Mark Rowland (Deepmind)
Romuald Elie (Deepmind)
Georgios Piliouras (Singapore University of Technology and Design)
Julien Perolat (Deepmind)
Mathieu Lauriere (Google Brain)
Raphael Marinier (Google Brain)
Olivier Pietquin (Google Brain)
Karl Tuyls (Deepmind)

A Graph-Based Algorithm for the Automated Justification of Collective Decisions (Page 935)
Oliviero Nardi (University of Amsterdam)
Arthur Boixel (University of Amsterdam)
Ulle Endriss (University of Amsterdam)

Deep Reinforcement Learning for Active Wake Control (Page 944)
Grigory Neustroev (Delft University of Technology)
Sytze P. E. Andringa (Delft University of Technology)
Remco A. Verzijlbergh (Delft University of Technology & Whiffle)
Mathijs M. De Weerdt (Delft University of Technology)

Learning Theory of Mind via Dynamic Traits Attribution (Page 954)
Dung Nguyen (Deakin University)
Phuoc Nguyen (Deakin University)
Hung Le (Deakin University)
Kien Do (Deakin University)
Svetha Venkatesh (Deakin University)
Truyen Tran (Deakin University)

Learning to Transfer Role Assignment Across Team Sizes (Page 963)
Dung Nguyen (Deakin University)
Phuoc Nguyen (Deakin University)
Svetha Venkatesh (Deakin University)
Truyen Tran (Deakin University)

CTRMs: Learning to Construct Cooperative Timed Roadmaps for Multi-agent Path Planning in Continuous Spaces (Page 972)
Keisuke Okumura (Tokyo Institute of Technology)
Ryo Yonetani (OMRON SINIC X)
Mai Nishimura (OMRON SINIC X)
Asako Kanezaki (Tokyo Institute of Technology)

Factorial Agent Markov Model: Modeling Other Agents' Behavior in presence of Dynamic Latent Decision Factors (Page 982)
Liubove Orlov-Savko (Rice University)
Abhinav Jain (Rice University)
Gregory M. Gremillion (CCDC Army Research Lab)
Catherine E. Neubauer (CCDC Army Research Lab)
Jonroy D. Canady (CCDC Army Research Lab)
Vaibhav Unhelkar (Rice University)

(Return to Top)

Networked Restless Multi-Armed Bandits for Mobile Interventions (Page 1001)
Han-Ching Ou (Harvard University)
Christoph Siebenbrunner (Harvard University)
Jackson Killian (Harvard University)
Meredith B. Brooks (Harvard University)
David Kempe (University of Southern California)
Yevgeniy Vorobeychik (University of Washington in St. Louis)
Milind Tambe (Harvard University)

Characterizing Attacks on Deep Reinforcement Learning (Page 1010)
Xinlei Pan (University of California, Berkeley)
Chaowei Xiao (NVIDIA & Arizona State University)
Warren He (University of California, Berkeley)
Shuang Yang (Alibaba)
Jian Peng (University of Illinois at Urbana-Champaign)
Mingjie Sun (Carnegie Mellon University)
Mingyan Liu (University of Michigan, Ann Arbor)
Bo Li (University of Illinois at Urbana-Champaign)
Dawn Song (University of California, Berkeley)

BOID*: Autonomous Goal Deliberation through Abduction (Page 1019)
Stipe Pandžić (Utrecht University)
Jan Broersen (Utrecht University)
Henk Aarts (Utrecht University)

Scaling Mean Field Games by Online Mirror Descent (Page 1028)
Julien Pérolat (DeepMind)
Sarah Perrin (University Lille, CNRS, Inria, Centrale Lille, UMR 9189 CRIStAL)
Romuald Elie (DeepMind)
Mathieu Laurière (Google Research)
Georgios Piliouras (Singapore University of Technology and Design)
Matthieu Geist (Google Research)
Karl Tuyls (DeepMind)
Olivier Pietquin (Google Research)

MORAL: Aligning AI with Human Norms through Multi-Objective Reinforced Active Learning (Page 1038)
Markus Peschl (Delft University of Technology)
Arkady Zgonnikov (Delft University of Technology)
Frans A. Oliehoek (Delft University of Technology)
Luciano C. Siebert (Delft University of Technology)

Emergent Cooperation from Mutual Acknowledgment Exchange (Page 1047)
Thomy Phan (LMU Munich)
Felix Sommer (LMU Munich)
Philipp Altmann (LMU Munich)
Fabian Ritz (LMU Munich)
Lenz Belzner (Technische Hochschule Ingolstadt)
Claudia Linnhoff-Popien (LMU Munich)

(Return to Top)

Auction-based and Distributed Optimization Approaches for Scheduling Observations in Satellite Constellations with Exclusive Orbit Portions (Page 1056)
Gauthier Picard (ONERA/DTIS, Université de Toulouse)

Trajectory Coordination based on Distributed Constraint Optimization Techniques in Unmanned Air Traffic Management (Page 1065)
Gauthier Picard (ONERA/DTIS, Université de Toulouse)

Learning Heuristics for Combinatorial Assignment by Optimally Solving Subproblems (Page 1074)
Fredrik Präntare (Linköping University)
Herman Appelgren (Linköping University)
Mattias Tiger (Linköping University)
David Bergström (Linköping University)
Fredrik Heintz (Linköping University)

Evaluating the Role of Interactivity on Improving Transparency in Autonomous Agents (Page 1083)
Peizhu Qian (Rice University)
Vaibhav Unhelkar (Rice University)

Revenue and User Traffic Maximization in Mobile Short-Video Advertising (Page 1092)
Dezhi Ran (Peking University)
Weiqiang Zheng (Yale University)
Yunqi Li (Peking University)
Kaigui Bian (Peking University)
Jie Zhang (University of Southampton)
Xiaotie Deng (Peking University)

Automated Configuration and Usage of Strategy Portfolios Mixed-Motive Bargaining (Page 1101)
Bram M. Renting (Leiden University & Delft University of Technology)
Holger H. Hoos (RWTH Aachen & Leiden University)
Catholijn M. Jonker (Delft University of Technology & Leiden University)

Pareto Conditioned Networks (Page 1110)
Mathieu Reymond (Vrije Universiteit Brussel)
Eugenio Bargiacchi (Vrije Universiteit Brussel)
Ann Nowé (Vrije Universiteit Brussel)

(Return to Top)

Testing Requirements via User and System Stories in Agent Systems (Page 1119)
Sebastian Rodriguez (RMIT University)
John Thangarajah (RMIT University)
Michael Winikoff (Victoria University of Wellington)
Dhirendra Singh (RMIT University)

GCS: Graph-Based Coordination Strategy for Multi-Agent Reinforcement Learning (Page 1128)
Jingqing Ruan (Institute of Automation, Chinese Academy of Sciences & University of Chinese Academy of Sciences)
Yali Du (King's College London)
Xuantang Xiong (Institute of Automation, Chinese Academy of Sciences)
Dengpeng Xing (Institute of Automation, Chinese Academy of Sciences)
Xiyun Li (Institute of Automation, Chinese Academy of Sciences)
Linghui Meng (Institute of Automation, Chinese Academy of Sciences)
Haifeng Zhang (Institute of Automation, Chinese Academy of Sciences)
Jun Wang (University College London)
Bo Xu (Institute of Automation, Chinese Academy of Sciences)

REMAX: Relational Representation for Multi-Agent Exploration (Page 1137)
Heechang Ryu (Samsung Research)
Hayong Shin (Korea Advanced Institute of Science and Technology)
Jinkyoo Park (Korea Advanced Institute of Science and Technology)

Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration (Page 1146)
Lukas Schäfer (University of Edinburgh)
Filippos Christianos (University of Edinburgh)
Josiah P. Hanna (University of Wisconsin - Madison)
Stefano V. Albrecht (University of Edinburgh)

Group Fairness in Bandits with Biased Feedback (Page 1155)
Candice Schumann (University of Maryland)
Zhi Lang (University of Maryland)
Nicholas Mattei (Tulane University)
John P. Dickerson (University of Maryland)

Sympathy-based Reinforcement Learning Agents (Page 1164)
Manisha Senadeera (Deakin University)
Thommen George Karimpanal (Deakin University)
Sunil Gupta (Deakin University)
Santu Rana (Deakin University)

(Return to Top)

Learning Efficient Diverse Communication for Cooperative Heterogeneous Teaming (Page 1173)
Esmaeil Seraj (Georgia Institute of Technology)
Zheyuan Wang (Georgia Institute of Technology)
Rohan Paleja (Georgia Institute of Technology)
Daniel Martin (Georgia Institute of Technology)
Matthew Sklar (Georgia Institute of Technology)
Anirudh Patel (Sandia National Laboratory)
Matthew Gombolay (Georgia Institute of Technology)

Using Deep Learning to Bootstrap Abstractions for Hierarchical Robot Planning (Page 1183)
Naman Shah (Arizona State University)
Siddharth Srivastava (Arizona State University)

ACuTE: Automatic Curriculum Transfer from Simple to Complex Environments (Page 1192)
Yash Shukla (Tufts University)
Christopher Thierauf (Tufts University)
Ramtin Hosseini (Tufts University)
Gyan Tatiya (Tufts University)
Jivko Sinapov (Tufts University)

Anti-Malware Sandbox Games (Page 1201)
Sujoy Sikdar (Binghamton University)
Sikai Ruan (Rensselaer Polytechnic Institute)
Qishen Han (Rensselaer Polytechnic Institute)
Paween Pitimanaaree (SCB Securities Co. Ltd.)
Jeremy Blackthorne (Boston Cybernetics Institute)
Bulent Yener (Rensselaer Polytechnic Institute)
Lirong Xia (Rensselaer Polytechnic Institute)

Properties of Reputation Lag Attack Strategies (Page 1210)
S. Sirur (University of Oxford)
Tim Muller (University of Nottingham)

The Generalized Magician Problem under Unknown Distributions and Related Applications (Page 1219)
Aravind Srinivasan (University of Maryland, College Park)
Pan Xu (New Jersey Institute of Technology)

Context-Aware Modelling for Multi-Robot Systems Under Uncertainty (Page 1228)
Charlie Street (University of Oxford)
Bruno Lacerda (University of Oxford)
Michal Staniaszek (University of Oxford)
Manuel Mühlig (Honda Research Institute Europe GmbH)
Nick Hawes (University of Oxford)

(Return to Top)

Off-Policy Evolutionary Reinforcement Learning with Maximum Mutations (Page 1237)
Karush Suri (University of Toronto)

Justifying Social-Choice Mechanism Outcome for Improving Participant Satisfaction (Page 1246)
Sharadhi Alape Suryanarayana (Bar-Ilan University)
David Sarne (Bar-Ilan University)
Sarit Kraus (Bar-Ilan University)

Descriptive and Prescriptive Visual Guidance to Improve Shared Situational Awareness in Human-Robot Teaming (Page 1256)
Aaquib Tabrez (University of Colorado Boulder)
Matthew B. Luebbers (University of Colorado Boulder)
Bradley Hayes (University of Colorado Boulder)

How Hard is Bribery in Elections with Randomly Selected Voters (Page 1265)
Liangde Tao (Zhejiang University)
Lin Chen (Texas Tech University)
Lei Xu (University of Texas Rio Grande Valley)
Weidong Shi (University of Houston)
Ahmed Sunny (Texas Tech University)
Md Mahabub Uz Zaman (Texas Tech University)

Socially Supervised Representation Learning: The Role of Subjectivity in Learning Efficient Representations (Page 1274)
Julius Taylor (Inria & Université de Bordeaux)
Eleni Nisioti (Inria & Université de Bordeaux)
Clément Moulin-Frier (Inria & Université de Bordeaux)

Corruption in Auctions: Social Welfare Loss in Hybrid Multi-Unit Auctions (Page 1283)
Andries van Beek (Tilburg University)
Ruben Brokkelkamp (Centrum Wiskunde & Informatica)
Guido Schäfer (Centrum Wiskunde, Informatica ILLC, & University of Amsterdam)

Coaching Agent: Making Recommendations for Behavior Change. A Case Study on Improving Eating Habits (Page 1292)
Jules Vandeputte (UMR MIA-Paris, AgroParisTech, INRAe, Université Paris-Saclay)
Antoine Cornuéjols (UMR MIA-Paris, AgroParisTech, INRAe, Université Paris-Saclay)
Nicolas Darcel (UMR PNCA, AgroParisTech, INRAe, Université Paris-Saclay)
Fabien Delaere (Danone Nutricia Research)
Christine Martin (UMR MIA-Paris, AgroParisTech, INRAe, Université Paris-Saclay)

(Return to Top)

How to Sense the World: Leveraging Hierarchy in Multimodal Perception for Robust Reinforcement Learning Agents (Page 1301)
Miguel Vasco (INESC-ID & Universidade de Lisboa)
Hang Yin (KTH Royal Institute of Technology)
Francisco S. Melo (INESC-ID & Universidade de Lisboa)
Ana Paiva (INESC-ID & Universidade de Lisboa)

Controller Synthesis for Omega-Regular and Steady-State Specifications (Page 1310)
Alvaro Velasquez (Air Force Research Laboratory)
Ismail Alkhouri (University of Central Florida)
Andre Beckus (Air Force Research Laboratory)
Ashutosh Trivedi (University of Colorado Boulder)
George Atia (University of Central Florida)

Graphical Representation Enhances Human Compliance with Principles for Graded Argumentation Semantics (Page 1319)
Srdjan Vesic (CNRS, Université d'Artois, CRIL)
Bruno Yun (University of Aberdeen)
Predrag Teovanovic (University of Belgrade)

Epistemic Reasoning in Jason (Page 1328)
Michael Vezina (Carleton University)
Babak Esfandiari (Carleton University)

Robust Learning from Observation with Model Misspecification (Page 1337)
Luca Viano (LIONS, EPFL)
Yu-Ting Huang (EPFL)
Parameswaran Kamalaruban (The Alan Turing Institute)
Craig Innes (The University of Edinburgh)
Subramanian Ramamoorthy (The University of Edinburgh)
Adrian Weller (University of Cambridge & The Alan Turing Institute)

Evaluating Strategy Exploration in Empirical Game-Theoretic Analysis (Page 1346)
Yongzhao Wang (University of Michigan)
Qiurui Ma (Harvard University)
Michael P. Wellman (University of Michigan)

FCMNet: Full Communication Memory Net for Team-Level Cooperation in Multi-Agent Systems (Page 1355)
Yutong Wang (National University of Singapore)
Guillaume Sartoretti (National University of Singapore)

(Return to Top)

Online Collective Multiagent Planning by Offline Policy Reuse with Applications to City-Scale Mobility-on-Demand Systems (Page 1364)
Wanyuan Wang (Southeast University)
Gerong Wu (Southeast University)
Weiwei Wu (Southeast University)
Yichuan Jiang (Southeast University)
Bo An (Nanyang Technological University)

Position-Based Matching with Multi-Modal Preferences (Page 1373)
Yinghui Wen (Shandong University)
Aizhong Zhou (Ocean University of China)
Jiong Guo (Shandong University)

Empirical Estimates on Hand Manipulation are Recoverable: A Step Towards Individualized and Explainable Robotic Support in Everyday Activities (Page 1382)
Alexander Wich (University of Bremen)
Holger Schultheis (University of Bremen)
Michael Beetz (University of Bremen)

Agent-Temporal Attention for Reward Redistribution in Episodic Multi-Agent Reinforcement Learning (Page 1391)
Baicen Xiao (University of Washington)
Bhaskar Ramasubramanian (Western Washington University)
Radha Poovendran (University of Washington)

SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning (Page 1400)
Zhiwei Xu (Institute of Automation, Chinese Academy of Sciences & University of Chinese Academy of Sciences)
Yunpeng Bai (Institute of Automation, Chinese Academy of Sciences & University of Chinese Academy of Sciences)
Dapeng Li (Institute of Automation, Chinese Academy of Sciences & University of Chinese Academy of Sciences)
Bin Zhang (Institute of Automation, Chinese Academy of Sciences & University of Chinese Academy of Sciences)
Guoliang Fan (Institute of Automation, Chinese Academy of Sciences & University of Chinese Academy of Sciences)

Spiking Pitch Black: Poisoning an Unknown Environment to Attack Unknown Reinforcement Learners (Page 1409)
Hang Xu (Nanyang Technological University)
Xinghua Qu (ByteDance AI Lab)
Zinovi Rabinovich (Nanyang Technological University)

Mis-spoke or mis-lead: Achieving Robustness in Multi-Agent Communicative Reinforcement Learning (Page 1418)
Wanqi Xue (Nanyang Technological University)
Wei Qiu (Nanyang Technological University)
Bo An (Nanyang Technological University)
Zinovi Rabinovich (Nanyang Technological University)
Svetlana Obraztsova (Nanyang Technological University)
Chai Kiat Yeo (Nanyang Technological University)

(Return to Top)

Standby-Based Deadlock Avoidance Method for Multi-Agent Pickup and Delivery Tasks (Page 1427)
Tomoki Yamauchi (Waseda University)
Yuki Miyashita (Waseda University)
Toshiharu Sugawara (Waseda University)

Adaptive Incentive Design with Multi-Agent Meta-Gradient Reinforcement Learning (Page 1436)
Jiachen Yang (Georgia Institute of Technology)
Ethan Wang (Georgia Institute of Technology)
Rakshit Trivedi (Harvard University)
Tuo Zhao (Georgia Institute of Technology)
Hongyuan Zha (Chinese University of Hong Kong, Shenzhen)

Strategy-Proof House Allocation with Existing Tenants over Social Networks (Page 1446)
Bo You (Kyushu University)
Ludwig Dierks (Kyushu University & University of Zurich)
Taiki Todo (Kyushu University)
Minming Li (City University of Hong Kong)
Makoto Yokoo (Kyushu University)

Segregation in Social Networks of Heterogeneous Agents Acting under Incomplete Information (Page 1455)
D. Kai Zhang (Imperial College London)
Alexander Carver (Imperial College London)

Multi-Agent Path Finding for Precedence-Constrained Goal Sequences (Page 1464)
Han Zhang (University of Southern California)
Jingkai Chen (Massachusetts Institute of Technology)
Jiaoyang Li (University of Southern California)
Brian C. Williams (Massachusetts Institute of Technology)
Sven Koenig (University of Southern California)

The Competition and Inefficiency in Urban Road Last-Mile Delivery (Page 1473)
Keyang Zhang (Imperial College London)
Jose Javier Escribano Macias (Imperial College London)
Dario Paccagnan (Imperial College London)
Panagiotis Angeloudis (Imperial College London)

(Return to Top)

Tracking Truth by Weighting Proxies in Liquid Democracy (Page 1482)
Yuzhe Zhang (University of Groningen)
Davide Grossi (University of Groningen & University of Amsterdam)

A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms (Page 1491)
Shangtong Zhang (University of Oxford)
Romain Laroche (Microsoft Research Montreal)
Harm van Seijen (Microsoft Research Montreal)
Shimon Whiteson (University of Oxford)
Remi Tachet des Combes (Microsoft Research Montreal)

Centralized Model and Exploration Policy for Multi-Agent RL (Page 1500)
Qizhen Zhang (University of Toronto & Vector Institute)
Chris Lu (University of Oxford)
Animesh Garg (University of Toronto, Vector Institute, & NVIDIA)
Jakob Foerster (University of Oxford)

Incentives to Invite Others to Form Larger Coalitions (Page 1509)
Yao Zhang (ShanghaiTech University)
Dengji Zhao (ShanghaiTech University)

Extended Abstracts

R-CHECK: A Model Checker for Verifying Reconfigurable MAS (Page 1518)
Yehia Abd Alrahman (University of Gothenburg)
Shaun Azzopardi (University of Gothenburg)
Nir Piterman (University of Gothenburg)

RASS: Risk-Aware Swarm Storage (Page 1521)
Samuel Arseneault (Polytechnique Montréal)
David Vielfaure (Polytechnique Montréal)
Giovanni Beltrame (Polytechnique Montréal)

Local Advantage Networks for Cooperative Multi-Agent Reinforcement Learning (Page 1524)
Raphaël Avalos (Vrije Universiteit Brussel)
Mathieu Reymond (Vrije Universiteit Brussel)
Ann Nowé (Vrije Universiteit Brussel)
Diederik M. Roijers (Vrije Universiteit Brussel (BE) & HU University of Applied Science Utrecht (NL))

(Return to Top)

Advising Agent for Service-Providing Live-Chat Operators (Page 1527)
Aviram Aviv (Bar Ilan University)
Yaniv Oshrat (Bar Ilan University)
Samuel Assefa (US Bank AI Innovation)
Toby Mustapha (J.P. Morgan AI Research)
Daniel Borrajo (J.P. Morgan AI Research)
Manuela Veloso (J.P. Morgan AI Research)
Sarit Kraus (Bar Ilan University)

Status-quo Policy Gradient in Multi-Agent Reinforcement Learning (Page 1530)
Pinkesh Badjatiya (Microsoft)
Mausoom Sarkar (Adobe)
Nikaash Puri (Adobe)
Jayakumar Subramanian (Adobe)
Abhishek Sinha (Waymo)
Siddharth Singh (University of Maryland)
Balaji Krishnamurthy (Adobe)

Deep Learnable Strategy Templates for Multi-Issue Bilateral Negotiation (Page 1533)
Pallavi Bagga (Royal Holloway, University of London)
Nicola Paoletti (Royal Holloway, University of London)
Kostas Stathis (Royal Holloway, University of London)

Can Algorithms be Explained Without Compromising Efficiency? The Benefits of Detection and Imitation in Strategic Classification (Page 1536)
Flavia Barsotti (ING Analytics & University of Amsterdam)
Rüya Gökhan Koçer (ING Analytics)
Fernando P. Santos (University of Amsterdam)

A New Porous Structure for Modular Robots (Page 1539)
Jad Bassil (University of Bourgogne Franche-Comté)
Benoît Piranda (University of Bourgogne Franche-Comté)
Abdallah Makhoul (University of Bourgogne Franche-Comté)
Julien Bourgeois (University of Bourgogne Franche-Comté)

On the Average-Case Complexity of Predicting Round-Robin Tournaments (Page 1542)
Dorothea Baumeister (Heinrich-Heine-Universität Düsseldorf)
Tobias Hogrebe (Heinrich-Heine-Universität Düsseldorf)

(Return to Top)

The Evolutionary Dynamics of Soft-Max Policy Gradient in Multi-Agent Settings (Page 1545)
Martino Bernasconi (Politecnico di Milano)
Federico Cacciamani (Politecnico di Milano)
Simone Fioravanti (Gran Sasso Science Institute)
Nicola Gatti (Politecnico di Milano)
Francesco Trovò (Politecnico di Milano)

A Refined Complexity Analysis of Fair Districting over Graphs (Page 1548)
Niclas Boehmer (TU Berlin)
Tomohiro Koana (TU Berlin)
Rolf Niedermeier (TU Berlin)

Contrastive Explanations for Argumentation-Based Conclusions (Page 1551)
AnneMarie Borg (Utrecht University)
Floris Bex (Utrecht University & Tilburg University)

Voting for Centrality (Page 1554)
Ulrik Brandes (ETH Zürich)
Christian Laußmann (Heinrich-Heine-University Düsseldorf)
Jörg Rothe (Heinrich-Heine-University Düsseldorf)

Solving N-Player Dynamic Routing Games with Congestion: A Mean-Field Approach (Page 1557)
Theophile Cabannes (University of California, Berkeley & Google)
Mathieu Laurière (Google Research)
Julien Perolat (DeepMind)
Raphael Marinier (Google Research)
Sertan Girgin (Google Research)
Sarah Perrin (University Lille, CNRS, Inria, Centrale Lille, UMR 9189 CRIStAL)
Olivier Pietquin (Google Research)
Alexandre M. Bayen (University of California, Berkeley)
Eric Goubault (LIX, CNRS, Ecole Polytechnique, IPP)
Romuald Elie (DeepMind)

On Fair and Efficient Solutions for Budget Apportionment (Page 1560)
Pierre Cardi (Université Paris-Dauphine, Université PSL, CNRS, LAMSADE)
Laurent Gourves (Université Paris-Dauphine, Université PSL, CNRS, LAMSADE)
Julien Lesca (Huawei Technologies)

(Return to Top)

Optimal Local Bayesian Differential Privacy over Markov Chains (Page 1563)
Darshan Chakrabarti (Carnegie Mellon University)
Jie Gao (Rutgers University)
Aditya Saraf (University of Washington)
Grant Schoenebeck (University of Michigan)
Fang-Yi Yu (Harvard University)

Augmented Reality Visualizations using Imitation Learning for Collaborative Warehouse Robots (Page 1566)
Kishan Chandan (SUNY Binghamton)
Jack Albertson (SUNY Binghamton)
Shiqi Zhang (SUNY Binghamton)

Multi-unit Double Auctions: Equilibrium Analysis and Bidding Strategy using DDPG in Smart-grids (Page 1569)
Sanjay Chandlekar (International Institute of Information Technology, Hyderabad)
Easwar Subramanian (TCS Innovation Labs)
Sanjay Bhat (TCS Innovation Labs)
Praveen Paruchuri (International Institute of Information Technology, Hyderabad)
Sujit Gujar (International Institute of Information Technology, Hyderabad)

Multi-agent Covering Option Discovery through Kronecker Product of Factor Graphs (Page 1572)
Jiayu Chen (Purdue University)
Jingdi Chen (The George Washington University)
Tian Lan (The George Washington University)
Vaneet Aggarwal (Purdue University)

Priced Gerrymandering (Page 1575)
Palash Dey (Indian Institute of Technology)

Behavior Exploration and Team Balancing for Heterogeneous Multiagent Coordination (Page 1578)
Gaurav Dixit (Oregon State University)
Kagan Tumer (Oregon State University)

(Return to Top)

Multi-Agent Adversarial Attacks for Multi-Channel Communications (Page 1580)
Juncheng Dong (Duke University)
Suya Wu (Duke University)
Mohammadreza Soltani (Duke University)
Vahid Tarokh (Duke University)

Rawlsian Fairness in Online Bipartite Matching: Two-sided, Group, and Individual (Page 1583)
Seyed A. Esmaeili (University of Maryland, College Park)
Sharmila Duppala (University of Maryland, College Park)
Vedant Nanda (University of Maryland, College Park)
Aravind Srinivasan (University of Maryland, College Park)
John P. Dickerson (University of Maryland, College Park)

Approaching the Overbidding Puzzle in All-Pay Auctions: Explaining Human Behavior through Bayesian Optimization and Equilibrium Learning (Page 1586)
Markus Ewert (Technical University of Munich)
Stefan Heidekrüger (Technical University of Munich)
Martin Bichler (Technical University of Munich)

Safety Shields, an Automated Failure Handling Mechanism for BDI Agents (Page 1589)
Angelo Ferrando (University of Genova)
Rafael C. Cardoso (The University of Manchester)

Beyond Uninformed Search: Improving Branch-and-bound Based Acceleration Algorithms for Belief Propagation via Heuristic Strategies (Page 1592)
Junsong Gao (Chongqing University)
Ziyu Chen (Chongqing University)
Dingding Chen (Chongqing University)
Wenxin Zhang (Chongqing University)

Stable Matching Games (Page 1595)
Felipe Garrido-Lucero (LAMSADE (CNRS, UMR 7243), Université Paris Dauphine)
Rida Laraki (LAMSADE (CNRS, UMR 7243), Université Paris Dauphine & University of Liverpool)

An Anytime Heuristic Algorithm for Allocating Many Teams to Many Tasks (Page 1598)
Athina Georgara (Artificial Intelligence Research Institute (IIIA-CSIC) & & Enzyme Advising Group)
Juan A. Rodríguez-Aguilar (Artificial Intelligence Research Institute (IIIA-CSIC))
Carles Sierra (Artificial Intelligence Research Institute (IIIA-CSIC))
Ornella Mich (Fondazione Bruno Kessler (FBK))
Raman Kazhamiakin (Fondazione Bruno Kessler (FBK))
Alessio Palmero Aprosio (Fondazione Bruno Kessler (FBK))
Jean-Christophe Pazzaglia (SAP)

(Return to Top)

Influencing Emergent Self-Assembled Structures in Robotic Collectives Through Traffic Control (Page 1601)
Everardo Gonzalez (Oregon State University)
Lucie Houel (Ecole Polytechnique Fédérale de Lausanne)
Radhika Nagpal (Harvard University)
Melinda Malley (Olin College of Engineering)

Minimizing Robot Navigation Graph for Position-Based Predictability by Humans (Page 1604)
Sriram Gopalakrishnan (Arizona State University)
Subbarao Kambhampati (Arizona State University)

A Graph Neural Network Reasoner for Game Description Language (Page 1607)
Alvaro Gunawan (Auckland University of Technology)
Ji Ruan (Auckland University of Technology)
Xiaowei Huang (University of Liverpool)

Adaptive Aggregation Weight Assignment for Federated Learning: A Deep Reinforcement Learning Approach (Page 1610)
Enwei Guo (South China University of Technology)
Xiumin Wang (South China University of Technology)
Weiwei Wu (Southeast University)

Proof-of-Work as a Stigmergic Consensus Algorithm (Page 1613)
Önder Gürcan (Université Paris-Saclay, CEA, List)

Capacitated Network Design Games on a Generalized Fair Allocation Model (Page 1616)
Tesshu Hanaka (Nagoya University)
Toshiyuki Hirose (KDDI Corporation)
Hirotaka Ono (Nagoya University)

(Return to Top)

Multi-agent Task Allocation for Fruit Picker Team Formation (Page 1618)
Helen Harman (University of Lincoln)
Elizabeth I. Sklar (University of Lincoln)

Decision-Theoretic Planning for the Expected Scalarised Returns (Page 1621)
Conor F. Hayes (National University of Ireland Galway)
Diederik M. Roijers (Vrije Universiteit Brussel & & HU University of Applied Science Utrecht)
Enda Howley (National University of Ireland Galway)
Patrick Mannion (National University of Ireland Galway)

Implementation of Actual Data for Artificial Market Simulation (Page 1624)
Masanori Hirano (The University of Tokyo)
Kiyoshi Izumi (The University of Tokyo)
Hiroki Sakaji (The University of Tokyo)

Intelligent Communication over Realistic Wireless Networks in Multi-Agent Cooperative Games (Page 1627)
Diyi Hu (University of Southern California)
Chi Zhang (University of Southern California)
Viktor Prasanna (University of Southern California)
Bhaskar Krishnamachari (University of Southern California)

Multiagent Q-learning with Sub-Team Coordination (Page 1630)
Wenhan Huang (Shanghai Jiao Tong University)
Kai Li (Shanghai Jiao Tong University)
Kun Shao (Huawei Noah's Ark Lab)
Tianze Zhou (Beijing Institute of Technology)
Jun Luo (Huawei Noah's Ark Lab)
Dongge Wang (EPFL)
Hangyu Mao (Huawei Noah's Ark Lab)
Jianye Hao (Huawei Noah's Ark Lab)
Jun Wang (University College London)
Xiaotie Deng (Peking University)

Guaranteeing Half-Maximin Shares Under Cardinality Constraints (Page 1633)
Halvard Hummel (Norwegian University of Science and Technology)
Magnus Lie Hetland (Norwegian University of Science and Technology)

(Return to Top)

Argumentative Forecasting (Page 1636)
Benjamin Irwin (Imperial College London)
Antonio Rago (Imperial College London)
Francesca Toni (Imperial College London)

Data-driven Agent-based Models for Optimal Evacuation of Large Metropolitan Areas for Improved Disaster Planning (Page 1639)
Kazi Ashik Islam (Biocomplexity Institute and Initiative & University of Virginia)
Madhav Marathe (Biocomplexity Institute and Initiative & University of Virginia)
Henning Mortveit (Biocomplexity Institute and Initiative & University of Virginia)
Samarth Swarup (Biocomplexity Institute and Initiative & University of Virginia)
Anil Vullikanti (Biocomplexity Institute and Initiative & University of Virginia)

Near-Optimal Reviewer Splitting in Two-Phase Paper Reviewing and Conference Experiment Design (Page 1642)
Steven Jecmen (Carnegie Mellon University)
Hanrui Zhang (Carnegie Mellon University)
Ryan Liu (Carnegie Mellon University)
Fei Fang (Carnegie Mellon University)
Vincent Conitzer (Duke University)
Nihar B. Shah (Carnegie Mellon University)

Learning to Advise and Learning from Advice in Cooperative Multiagent Reinforcement Learning (Page 1645)
Yue Jin (Tsinghua University)
Shuangqing Wei (Louisiana State University)
Jian Yuan (Tsinghua University)
Xudong Zhang (Tsinghua University)

REFORM: Reputation Based Fair and Temporal Reward Framework for Crowdsourcing (Page 1648)
Samhita Kanaparthy (International Institute of Information Technology, Hyderabad)
Sankarshan Damle (International Institute of Information Technology, Hyderabad)
Sujit Gujar (International Institute of Information Technology, Hyderabad)

Forgiving Debt in Financial Network Games (Page 1651)
Panagiotis Kanellopoulos (University of Essex)
Maria Kyropoulou (University of Essex)
Hao Zhou (University of Essex)

(Return to Top)

How to Train Your Agent: Active Learning from Human Preferences and Justifications in Safety-critical Environments (Page 1654)
Ilias Kazantzidis (University of Southampton)
Timothy J. Norman (University of Southampton)
Yali Du (King's College London)
Christopher T. Freeman (University of Southampton)

Popularity and Strict Popularity in Altruistic Hedonic Games and Minimum-Based Altruistic Hedonic Games (Page 1657)
Anna Maria Kerkmann (Heinrich-Heine-Universität Düsseldorf)
Jörg Rothe (Heinrich-Heine-Universität Düsseldorf)

Minimizing Expected Intrusion Detection Time in Adversarial Patrolling (Page 1660)
David Klaška (Masaryk University)
Antonín Kučera (Masaryk University)
Vit Musil (Masaryk University)
Vojtěch Řehák (Masaryk University)

Learning Generalizable Multi-Lane Mixed-Autonomy Behaviors in Single Lane Representations of Traffic (Page 1663)
Abdul Rahman Kreidieh (University of California, Berkeley)
Yibo Zhao (University of California, Berkeley)
Samyak Parajuli (University of California, Berkeley)
Alexandre M. Bayen (University of California, Berkeley)

Measuring Resilience in Collective Robotic Algorithms (Page 1666)
Jennifer Leaf (Oregon State University)
Julie A. Adams (Oregon State University)

Automated Story Sifting Using Story Arcs (Page 1669)
Wilkins Leong (RMIT University)
Julie Porteous (RMIT University)
John Thangarajah (RMIT University)

(Return to Top)

Theoretical Models and Preliminary Results for Contact Tracing and Isolation (Page 1672)
George Z. Li (University of Maryland)
Arash Haddadan (University of Virginia)
Ann Li (University of Virginia)
Madhav V. Marathe (University of Virginia)
Aravind Srinivasan (University of Maryland)
Anil Vullikanti (University of Virginia)
Zeyu Zhao (University of Maryland)

Improving Generalization with Cross-State Behavior Matching in Deep Reinforcement Learning (Page 1675)
Guan-Ting Liu (National Taiwan University)
Guan-Yu Lin (National Taiwan University)
Pu-Jen Cheng (National Taiwan University)

(Almost) Envy-Free, Proportional and Efficient Allocations of an Indivisible Mixed Manna (Page 1678)
Vasilis Livanos (University of Illinois at Urbana-Champaign)
Ruta Mehta (University of Illinois at Urbana-Champaign)
Aniket Murhekar (University of Illinois at Urbana-Champaign)

Modeling Affective Reaction in Multi-agent Systems (Page 1681)
Jieting Luo (Zhejiang University)
Mehdi Dastani (Utrecht University)

Multimodal Reinforcement Learning with Effective State Representation Learning (Page 1684)
Jinming Ma (University of Science and Technology of China)
Yingfeng Chen (Netease Fuxi AI Lab)
Feng Wu (University of Science and Technology of China)
Xianpeng Ji (Netease Fuxi AI Lab)
Yu Ding (Netease Fuxi AI Lab)

Group-level Fairness Maximization in Online Bipartite Matching (Page 1687)
Will Ma (Columbia University)
Pan Xu (New Jersey Institute of Technology)
Yifan Xu (Southeast University)

A Simulation Based Online Planning Algorithm for Multi-Agent Cooperative Environments (Page 1690)
Rafid Ameer Mahmud (University of Dhaka)
Fahim Faisal (University of Dhaka)
Saaduddin Mahmud (University of Massachusetts, Amherst)
Md. Mosaddek Khan (University of Dhaka)

(Return to Top)

Parameterized Algorithms for Kidney Exchange (Page 1693)
Arnab Maiti (Indian Institute of Technology Kharagpur)
Palash Dey (Indian Institute of Technology Kharagpur)

Active Generation of Logical Rules for POMCP Shielding (Page 1696)
Giulio Mazzi (Università degli Studi di Verona)
Alberto Castellini (Università degli Studi di Verona)
Alessandro Farinelli (Università degli Studi di Verona)

Reinforcement Learning for Traffic Signal Control Optimization: A Concept for Real-World Implementation (Page 1699)
Henri Meess (Fraunhofer IVI)
Jeremias Gerner (Technische Hochschule Ingolstadt)
Daniel Hein (GEVAS software GmbH)
Stefanie Schmidtner (Technische Hochschule Ingolstadt)
Gordon Elger (Fraunhofer IVI)

Towards Assume-Guarantee Verification of Strategic Ability (Page 1702)
Łukasz Mikulski (Nicolaus Copernicus University & Institute of Computer Science, Polish Academy of Sciences)
Wojciech Jamroga (Institute of Computer Science, Polish Academy of Sciences & University of Luxembourg)
Damian Kurpiewski (Institute of Computer Science Polish Academy of Sciences & Nicolaus Copernicus University)

On Achieving Leximin Fairness and Stability in Many-to-One Matchings (Page 1705)
Shivika Narang (Indian Institute of Science)
Arpita Biswas (Harvard University)
Yadati Narahari (Indian Institute of Science)

Towards an Enthymeme-Based Communication Framework (Page 1708)
Alison R. Panisson (Universidade Federal de Santa Catarina)
Peter McBurney (King's College London)
Rafael H. Bordini (Pontifical Catholic University of Rio Grande do Sul)

(Return to Top)

I Will Have Order! Optimizing Orders for Fair Reviewer Assignment (Page 1711)
Justin Payan (University of Massachusetts, Amherst)
Yair Zick (University of Massachusetts, Amherst)

Concise Representations and Complexity of Combinatorial Assignment Problems (Page 1714)
Fredrik Präntare (Linköping University)
George Osipov (Linköping University)
Leif Eriksson (Linköping University)

A Stit Logic of Responsibility (Page 1717)
Aldo Iván Ramírez Abarca (Utrecht University)
Jan Broersen (Utrecht University)

Behavior vs Appearance: What Type of Adaptations are More Socially Motivated? (Page 1720)
Diogo Rato (INESC-ID & Universidade de Lisboa)
Marta Couto (INESC-ID)
Rui Prada (INESC-ID & Universidade de Lisboa)

Agent-Time Attention for Sparse Rewards Multi-Agent Reinforcement Learning (Page 1723)
Jennifer She (Stanford University)
Jayesh K. Gupta (Microsoft)
Mykel J. Kochenderfer (Stanford University)

Environment Guided Interactive Reinforcement Learning: Learning from Binary Feedback in High-Dimensional Robot Task Environments (Page 1726)
Isaac Sheidlower (Tufts University)
Elaine Schaertl Short (Tufts University)
Allison Moore (Tufts University)

Pre-trained Language Models as Prior Knowledge for Playing Text-based Games (Page 1729)
Ishika Singh (Indian Institute of Technology Kanpur)
Gargi Singh (Indian Institute of Technology Kanpur)
Ashutosh Modi (Indian Institute of Technology Kanpur)

(Return to Top)

Resource-Aware Adaptation of Heterogeneous Strategies for Coalition Formation (Page 1732)
Anusha Srikanthan (University of Pennsylvania)
Harish Ravichandar (Georgia Institute of Technology)

Speeding up Deep Reinforcement Learning through Influence-Augmented Local Simulators (Page 1735)
Miguel Suau (Delft University of Technology)
Jinke He (Delft University of Technology)
Matthijs T. J. Spaan (Delft University of Technology)
Frans A. Oliehoek (Delft University of Technology)

Maximizing Resource Allocation Likelihood with Minimum Compromise (Page 1738)
Yohai Trabelsi (Bar-Ilan University)
Abhijin Adiga (University of Virginia)
Sarit Kraus (Bar Ilan University)
S. S. Ravi (University of Virginia & University of Albany - SUNY)

Max-sum with Quadtrees for Continuous DCOPs with Application to Lane-Free Autonomous Driving (Page 1741)
Dimitrios Troullinos (Technical University of Crete)
Georgios Chalkiadakis (Technical University of Crete)
Vasilis Samoladas (Technical University of Crete)
Markos Papageorgiou (Technical University of Crete)

Autonomous Flight Arcade Challenge: Single- and Multi-Agent Learning Environments for Aerial Vehicles (Page 1744)
Paul Tylkin (Massachusetts Institute of Technology)
Tsun-Hsuan Wang (Massachusetts Institute of Technology)
Tim Seyde (Massachusetts Institute of Technology)
Kyle Palko (U.S. Air Force Artificial Intelligence Accelerator)
Ross Allen (Massachusetts Institute of Technology)
Alexander Amini (Massachusetts Institute of Technology)
Daniela Rus (Massachusetts Institute of Technology)

Non-Parametric Neuro-Adaptive Coordination of Multi-Agent Systems (Page 1747)
Christos K. Verginis (University of Texas at Austin)
Zhe Xu (Arizona State University)
Ufuk Topcu (University of Texas at Austin)

Moving Target Defense under Uncertainty for Web Applications (Page 1750)
Vignesh Viswanathan (University of Massachusetts, Amherst)
Megha Bose (International Institute of Information Technology, Hyderabad)
Praveen Paruchuri (International Institute of Information Technology, Hyderabad)

(Return to Top)

The Ethical Acceptability of Artificial Social Agents (Page 1753)
Ravi Vythilingam (Macquarie University)
Deborah Richards (Macquarie University)
Paul Formosa (Macquarie University)

Near On-Policy Experience Sampling in Multi-Objective Reinforcement Learning (Page 1756)
Shang Wang (University of Washington)
Mathieu Reymond (Vrije Universiteit Brussel)
Athirai A. Irissappane (University of Washington)
Diederik M. Roijers (Vrije Universiteit Brussel & HU University of Applied Sciences Utrecht)

On Agent Incentives to Manipulate Human Feedback in Multi-Agent Reward Learning Scenarios (Page 1759)
Francis Rhys Ward (Imperial College London)
Francesca Toni (Imperial College London)
Francesco Belardinelli (Imperial College London)

How to Train PointGoal Navigation Agents on a (Sample and Compute) Budget (Page 1762)
Erik Wijmans (Georgia Institute of Technology & Facebook AI Research)
Irfan Essa (Georgia Institute of Technology & Google Atlanta)
Dhruv Batra (Georgia Institute of Technology & Facebook AI Research)

Performance of Deep Reinforcement Learning for High Frequency Market Making on Actual Tick Data (Page 1765)
Ziyi Xu (Peking University)
Xue Cheng (Peking University)
Yangbo He (Peking University)

On the Complexity of Controlling Amendment and Successive Winners (Page 1768)
Yongjie Yang (Saarland University)

On-the-fly Strategy Adaptation for ad-hoc Agent Coordination (Page 1771)
Jaleh Zand (University of Oxford)
Jack Parker-Holder (University of Oxford)
Stephen J. Roberts (University of Oxford)

(Return to Top)

Off-Policy Correction For Multi-Agent Reinforcement Learning (Page 1774)
Michał Zawalski (University of Warsaw)
Błażej Osiński (University of Warsaw)
Henryk Michalewski (Google Research)
Piotr Miłoś (Polish Academy of Sciences)

An Agent-based Model for Emergency Evacuation from a Multi-floor Building (Page 1777)
Xiaoyan Zhang (Newcastle University)
Graham Coates (Newcastle University)
Sarah Dunn (Newcastle University)
Jean Hall (Newcastle University)

Irrational Behaviour and Globalisation (Page 1780)
Yuanzi Zhu (King's College London)
Carmine Ventre (King's College London)

Blue Sky Ideas Track

Robots Teaching Humans: A New Communication Paradigm via Reverse Teleoperation (Page 1783)
Rika Antonova (Stanford University)
Ankur Handa (NVIDIA)

Social Choice Around the Block: On the Computational Social Choice of Blockchain (Page 1788)
Davide Grossi (University of Groningen & University of Amsterdam)

Augmented Democratic Deliberation: Can Conversational Agents Boost Deliberation in Social Media? (Page 1794)
Rafik Hadfi (Kyoto University)
Takayuki Ito (Kyoto University)

Towards Anomaly Detection in Reinforcement Learning (Page 1799)
Robert Müller (LMU Munich)
Steffen Illium (LMU Munich)
Thomy Phan (LMU Munich)
Tom Haider (Fraunhofer IKS)
Claudia Linnhoff-Popien (LMU Munich)

(Return to Top)

The Holy Grail of Multi-Robot Planning: Learning to Generate Online-Scalable Solutions from Offline-Optimal Experts (Page 1804)
Amanda Prorok (University of Cambridge)
Jan Blumenkamp (University of Cambridge)
Qingbiao Li (University of Cambridge)
Ryan Kortvelesy (University of Cambridge)
Zhe Liu (University of Cambridge)
Ethan Stump (DEVCOM Army Research Laboratory)

"Go to the Children": Rethinking Intelligent Agent Design and Programming in a Developmental Learning Perspective (Page 1809)
Alessandro Ricci (Università di Bologna)

Foundations for Grassroots Democratic Metaverse (Page 1814)
Ehud Shapiro (Weizmann Institute of Science & Columbia University)
Nimrod Talmon (Ben-Gurion University)

Agent-Assisted Life-Long Education and Learning (Page 1819)
Tomas Trescak (Western Sydney University)
Roger Lera-Leri (Artificial Intelligence Research Institute (IIIA-CSIC))
Filippo Bistaffa (Artificial Intelligence Research Institute (IIIA-CSIC))
Juan A. Rodriguez-Aguilar (Artificial Intelligence Research Institute (IIIA-CSIC))

Macro Ethics for Governing Equitable Sociotechnical Systems (Page 1824)
Jessica Woodgate (The University of Bristol)
Nirav Ajmeri (The University of Bristol)

Doctoral Consortium

Exploration and Communication for Partially Observable Collaborative Multi-Agent Reinforcement Learning (Page 1829)
Raphaël Avalos (Vrije Universiteit Brussel)

(Return to Top)

Manipulation of Machine Learning Algoirhtms (Page 1833)
Nicholas Bishop (University of Southampton)

Collaborative Training of Multiple Autonomous Agents (Page 1836)
Filippos Christianos (University of Edinburgh)

Towards Multi-Agent Interactive Reinforcement Learning for Opportunistic Software Composition in Ambient Environments (Page 1839)
Kevin Delcourt (IRIT, Université de Toulouse, CNRS, Toulouse INP, UT3)

Online Learning against Strategic Adversary (Page 1841)
Le Cong Dinh (University of Southampton)

Non-Cooperative Multi-Robot Planning Under Shared Resources (Page 1843)
Anna Gautier (University of Oxford)

Incentive Design for Equitable Resource Allocation: Artificial Currencies and Allocation Constraints (Page 1846)
Devansh Jalota (Stanford University)

(Return to Top)

Model-free and Model-based Reinforcement Learning, the Intersection of Learning and Planning (Page 1849)
Piotr Januszewski (Gdańsk University of Technology)

Data-driven Approaches for Formal Synthesis of Dynamical Systems (Page 1852)
Milad Kazemi (Newcastle University)

Budget Feasible Mechanisms in Auction Markets: Truthfulness, Diffusion and Fairness (Page 1854)
Xiang Liu (Southeast University)

Fair Allocation Problems in Reviewer Assignment (Page 1857)
Justin Payan (University of Massachusetts, Amherst)

Designing Mechanisms for Participatory Budgeting (Page 1860)
Simon Rey (University of Amsterdam)

Task Generalisation in Multi-Agent Reinforcement Learning (Page 1863)
Lukas Schäfer (University of Edinburgh)

Empathetic Reinforcement Learning Agents (Page 1866)
Manisha Senadeera (Deakin University)

(Return to Top)

Embodied Team Intelligence in Multi-Robot Systems (Page 1869)
Esmaeil Seraj (Georgia Institute of Technology)

The Reputation Lag Attack (Page 1872)
Sean Sirur (University of Oxford)

Using Multi-objective Optimization to Generate Timely Responsive BDI Agents (Page 1875)
Márcio Fernando Stabile Junior (Universidade de São Paulo)

Engineering Normative and Cognitive Agents with Emotions and Values (Page 1878)
Sz-Ting Tzeng (North Carolina State University)

The Coaching Scenario: Recommender Systems with a Long Term Goal. A Case Study in Changing Dietary Habits (Page 1881)
Jules Vandeputte (UMR MIA-Paris, AgroParisTech, INRAe, Université Paris-Saclay)

Transferable Environment Poisoning: Training-time Attack on Reinforcement Learner with Limited Prior Knowledge (Page 1884)
Hang Xu (Nanyang Technological University)

Demonstration Track

(Return to Top)

Chameleon - A Framework for Developing Conversational Agents for Medical Training Purposes (Page 1887)
Al-Hussein Abutaleb (University of Aberdeen)
Bruno Yun (University of Aberdeen)

An Agent-Based Simulator for Maritime Transport Decarbonisation (Page 1890)
Jan Buermann (University of Southampton)
Dimitar Georgiev (University of Southampton)
Enrico H. Gerding (University of Southampton)
Lewis Hill (University of Southampton)
Obaid Malik (University of Southampton)
Alexandru Pop (University of Southampton)
Matthew Pun (Shell Shipping & Maritime)
Sarvapali D. Ramchurn (University of Southampton)
Elliot Salisbury (University of Southampton)
Ivan Stojanovic (Shell Shipping & Maritime)

AdLeap-MAS: An Open-source Multi-Agent Simulator for Ad-hoc Reasoning (Page 1893)
Matheus Aparecido do Carmo Alves (Lancaster University)
Amokh Varma (Indian Institute of Technology)
Yehia Elkhatib (University of Glasgow)
Leandro Soriano Marcolino (Lancaster University)

KnowLedger - A Multi-Agent System Blockchain for Smart Cities Data (Page 1896)
Bruno Fernandes (University of Minho)
André Diogo (University of Minho)
Fábio Silva (Polytechnic Institute of Porto)
José Neves (University of Minho)
Cesar Analide (University of Minho)

A Multi-Agent System for Automated Machine Learning (Page 1899)
Bruno Fernandes (University of Minho)
Paulo Novais (University of Minho)
Cesar Analide (University of Minho)

Demonstrating the Rapid Integration & Development Environment (RIDE): Embodied Conversational Agent (ECA) and Multiagent Capabilities (Page 1902)
Arno Hartholt (University of Southern California Institute for Creative Technologies)
Ed Fast (University of Southern California Institute for Creative Technologies)
Andrew Leeds (University of Southern California Institute for Creative Technologies)
Kevin Kim (University of Southern California Institute for Creative Technologies)
Andrew Gordon (University of Southern California Institute for Creative Technologies)
Kyle McCullough (University of Southern California Institute for Creative Technologies)
Volkan Ustun (University of Southern California Institute for Creative Technologies)
Sharon Mozgai (University of Southern California Institute for Creative Technologies)

SIERRA: A Modular Framework for Research Automation (Page 1905)
John Harwell (University of Minnesota)
London Lowmanstone (University of Minnesota)
Maria Gini (University of Minnesota)

(Return to Top)

Cellulan World: Interactive Platform to Learn Swarm Behaviors (Page 1908)
Hala Khodr (Swiss Federal Institute of Technology (EPFL))
Barbara Bruno (Swiss Federal Institute of Technology (EPFL))
Aditi Kothiyal (Swiss Federal Institute of Technology (EPFL))
Pierre Dillenbourg (Swiss Federal Institute of Technology (EPFL))

Ev-IDID: Enhancing Solutions to Interactive Dynamic Influence Diagrams through Evolutionary Algorithms (Page 1911)
Biyang Ma (Minnan Normal University)
Yinghui Pan (Shenzhen University)
Yifeng Zeng (Northumbria University)
Zhong Ming (Shenzhen University)

LBfT: Learning Bayesian Network Structures from Text in Autonomous Typhoon Response Systems (Page 1914)
Yinghui Pan (Shenzhen University)
Junhan Chen (Xiamen University)
Yifeng Zeng (Northumbria University)
Zhangrui Yao (Xiamen University)
Qianwen Li (Shenzhen University)
Biyang Ma (Northumbria University)
Yi Ji (Shenzhen University)
Zhong Ming (Shenzhen University)

JEDAI: A System for Skill-Aligned Explainable Robot Planning (Page 1917)
Naman Shah (Arizona State University)
Pulkit Verma (Arizona State University)
Trevor Angle (Arizona State University)
Siddharth Srivastava (Arizona State University)

JAAMAS Track

Reaching Consensus Under a Deadline (Page 1920)
Marina Bánnikova (Universidad Autónoma de Barcelona)
Lihi Dery (Ariel University)
Svetlana Obraztsova (Nanyang Technological University)
Zinovi Rabinovich (Nanyang Technological University)
Jeffrey S. Rosenschein (The Hebrew University of Jerusalem)

Goal-Driven Active Learning (Page 1923)
Nicolas Bougie (The Graduate University for Advanced Studies (Sokendai) & National Institute of Informatics)
Ryutaro Ichise (National Institute of Informatics & The Graduate University for Advanced Studies (Sokendai))

Combining Quantitative and Qualitative Reasoning in Concurrent Multi-player Games (Page 1926)
Nils Bulling (Clausthal University of Technology)
Valentin Goranko (Stockholm University)

(Return to Top)

Voting with Random Classifiers (VORACE): Theoretical and Experimental Analysis (Page 1929)
Cristina Cornelio (Samsung AI)
Michele Donini (Amazon)
Andrea Loreggia (University of Brescia)
Maria Silvia Pini (University of Padova)
Francesca Rossi (IBM Research)

Enabling BDI Group Plans with Coordination Middleware: Semantics and Implementation (Page 1932)
Stephen Cranefield (University of Otago)

GDL as a Unifying Domain Description Language for Declarative Automated Negotiation (Page 1935)
Dave de Jonge (IIIA-CSIC)
Dongmo Zhang (Western Sydney University)

Designing Efficient and Fair Mechanisms for Multi-Type Resource Allocation (Page 1938)
Xiaoxi Guo (Peking University)
Sujoy Sikdar (Binghamton University)
Haibin Wang (Peking University)
Lirong Xia (Rensselaer Polytechnic Institute)
Yongzhi Cao (Peking University)
Hanpin Wang (Guangzhou University & Peking University)

Automatic Calibration Framework of Agent-based Models for Dynamic and Heterogeneous Parameters (Page 1941)
Dongjun Kim (Korea Advanced Institute of Science and Technology)
Tae-Sub Yun (Korea Advanced Institute of Science and Technology)
Il-Chul Moon (Korea Advanced Institute of Science and Technology)
Jang Won Bae (Korea University of Technology and Education)

Trust Repair in Human-Agent Teams: The Effectiveness of Explanations and Expressing Regret (Page 1944)
E.S. Kox (TNO)
J.H. Kerstholt (TNO)
T.F. Hueting (TNO)
P.W. de Vries (University of Twente)

(Return to Top)

Concurrent Negotiations with Global Utility Functions (Page 1947)
Yasser Mohammad (NEC Corporation & National Institute of Advanced Industrial Science and Technology)
Shinji Nakadai (NEC Corporation & National Institute of Advanced Industrial Science and Technology)

Towards Addressing Dynamic Multi-agent Task Allocation in Law Enforcement (Page 1950)
Itshak Tkach (London University)
Sofia Amador Nelke (Holon Institute of Technology)