| Wednesday May 15 (17:15-18:15) |
| Title |
Authors |
| Approximation Gradient Error Variance Reduced Optimization |
Weiye Zhao |
| Credulous Acceptability, Poison Games and Modal Logic |
Davide Grossi, Simon Rey |
| Social Mobilization to Reposition Indiscriminately Parked Shareable Bikes |
Zelei Liu, Han Yu, Leye Wang, Liang Hu, Qiang Yang |
| A Regulation Enforcement Solution for Multi-agent Reinforcement Learning |
Sun Fan-Yun, Yen-Yu Chang, Yueh-Hua Wu, Shou-De Lin |
| Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards
Sophisticated Opponents |
Tianpei Yang, Jianye Hao, Zhaopeng Meng, Chongjie Zhang, Yan Zheng |
| Multi-agent Path Planning with Non-constant Velocity Motion |
Ngai Meng Kou, Cheng Peng, Xiaowei Yan, Zhiyuan Yang, Heng Liu, Kai Zhou,
Haibing Zhao, Lijun Zhu, Yinghui Xu |
| Complexity and Approximations in Robust Coalition Formation via Max-Min
k-Partitioning |
Anisse Ismaili, Makoto Yokoo, Noam Hazon, Sarit Kraus, Emi Watanabe |
| Contradict The Machine: a Hybrid Approach to Identifying Unknown Unknowns |
Colin Vandenhof, Edith Law |
| Invincible Strategies of Iterated Prisoner's Dilemma |
Shiheng Wang, Fangzhen Lin |
| General-Sum Cyber Deception Games under Partial Attacker Valuation
Information |
Omkar Thakoor, Phebe Vayanos, Christopher Kiekintveld, Milind Tambe,
Haifeng Xu |
| Optimising Worlds to Evaluate and Influence Reinforcement Learning Agents |
Richard Everett, Adam Cobb, Stephen Roberts, Andrew Markham |
| Broken Signals in Security Games: Coordinating Patrollers and Sensors in
the Real World |
Elizabeth Bondi, Hoon Oh, Haifeng Xu, Fei Fang, Bistra Dilkina, Milind
Tambe |
| Probabilistic resource-bounded alternating-time temporal logic |
Hoang Nga Nguyen, Abdur Rakib |
| A Polynomial-time Fragment of Epistemic Probabilistic Argumentation |
Nico Potyka |
| Bayesian-DPOP for continuous Distributed Constraint Optimization Problems |
Jeroen Fransman, Bart De Schutter, Henry Dol, Erik Theunissen, Joris Sijs |
| Distributed Policy Iteration for Scalable Approximation of Cooperative
Multi-Agent Policies |
Thomy Phan, Kyrill Schmid, Lenz Belzner, Thomas Gabor, Sebastian Feld,
Claudia Linnhoff-Popien |
| Avoiding Social Disappointment in Elections |
Mohammad Ali Javidian, Pooyan Jamshidi, Rasoul Ramezanian |
| Landmark Based Reward Shaping in Reinforcement Learning with Hidden
States |
Alper Demir, Erkin Çilden, Faruk Polat |
| Cooperating in Long-term Relationships with Time-Varying Structure |
Jacob Crandall, Huy Pham |
| Dynamic and intelligent control of autonomous vehicles for highway
on-ramp merge |
Zine el abidine Kherroubi, Samir Aknine, Rebiha Bacha |
| MCTS-based Automated Negotiation Agent |
Cédric Buron, Zahia Guessoum, Sylvain Ductor |
| The Complexity of Additive Committee Selection with Outliers |
Yongjie Yang, Jianxin Wang |
| Oblivious Envy-Free Allocations of Indivisible Goods |
Hau Chan, Jing Chen, Bo Li, Xiaowei Wu |
| Advice Replay Approach for Richer Knowledge Transfer in Teacher Student
Framework |
Vaibhav Gupta, Daksh Anand, Praveen Paruchuri, Balaraman Ravindran |
| Proportional Representation in Elections: STV vs PAV |
Piotr Faliszewski, Piotr Skowron, Stanislaw Szufa, Nimrod Talmon |
| Simple Contest Enhancers |
Michal Habani, Priel Levy, David Sarne |
| Policy Networks: A Framework for Scalable Integration of Multiple
Decision-Making Models |
Kyle Wray, Shlomo Zilberstein |
| Multiagent Monte Carlo Tree Search |
Nicholas Zerbel, Logan Yliniemi |
| Using surrogate models to calibrate agent-based model parameters under
data scarcity |
Priscilla Avegliano, Jaime Sichman |
| Learning Simulation-Based Games from Data |
Enrique Areyan Viqueira, Cyrus Cousins, Amy Greenwald, Eli Upfal |
| Attention-based Deep Reinforcement Learning for Multi-view Environments |
Elaheh Barati, Xuewen Chen |
| Generating an Agent Taxonomy Using Topological Data Analysis |
Samarth Swarup, Reza Rezazadegan |
| Warning Time: Optimizing Strategic Signaling for Security Against
Boundedly Rational Adversaries |
Sarah Cooney, Phebe Vayanos, Thanh Nguyen, Cleotilde Gonzalez, Christian
Lebiere, Edward Cranford, Milind Tambe |
| Coordination Structures Generated by Deep Reinforcement Learning in
Distributed Task Executions |
Yuki Miyashita, Toshiharu Sugawara |
| Memory Based Multiagent One Shot Learning |
Shauharda Khadka, Connor Yates, Kagan Tumer |
| Obvious Strategyproofness, Bounded Rationality and Approximation |
Diodato Ferraioli, Carmine Ventre |
| An Optimal Rewiring Strategy for Cooperative Multiagent Social Learning |
Hongyao Tang, Jianye Hao, Li Wang, Zan Wang, Tim Baarslag |
| Improving Wind Power Forecasting through Cooperation: A Case-Study on
Operating Farms |
Tanguy Esteoule, Carole Bernon, Marie-Pierre Gleizes, Morgane Barthod |
| Evaluation of Optimization for Pedestrian Route Guidance in Real-world
Crowded Scene |
Shusuke Shigenaka, Shunki Takami, Masaki Onishi, Itsuki Noda, Tomohisa
Yamashita |
| Cooperative Routing with Heterogeneous Vehicles |
Keisuke Otaki, Satoshi Koide, Ayano Okoso, Tomoki Nishi |
| Distributed Task Assignment and Path Planning with Limited Communication
for Robot Teams |
Dario Albani, Wolfgang Hoenig, Nora Ayanian, Daniele Nardi, Vito Trianni |
| How to get the most from goods donated to charities |
Christopher Culley, Ji Qi, Carmine Ventre |
| Actor-Critic Algorithms for Constrained Multi-agent Reinforcement
Learning |
Raghuram Bharadwaj Diddigi, Sai Koti Reddy Danda, Prabuchandran
Krithivasan Jayachandran, Shalabh Bhatnagar |
| Thompson Sampling Based Multi-Armed-Bandit Mechanism Using Neural
Networks |
Manisha Padala, Sujit Gujar |
| Computing Stable Solutions in Threshold Network Flow Games With Bounded
Treewidth |
Aldo Pacchiano, Yoram Bachrach |
| Hybrid BiLSTM-Siamese Network for Relation Extraction |
Zeyuan Cui, Shijun Liu |
| Efficient City-Scale Patrolling Using Decomposition and Grafting |
Wanyuan Wang, Zichen Dong, Bo An, Yichuan Jiang |
| Risk Averse Reinforcement Learning for Mixed Multi-agent Environments |
Sai Koti Reddy Danda, Amrita Saha, Srikanth Tamilselvam, Priyanka
Agrawal, Pankaj Dayama |
| From Hotelling to Load Balancing: Approximation and the Principle of
Minimum Differentiation |
Matthias Feldotto, Pascal Lenzner, Louise Molitor, Alexander Skopalik |
| Online Motion Concept Learning: A novel algorithm for sample-efficient
learning and recognition of human actions |
Miguel Vasco, Francisco Melo, David Martins de Matos, Ana Paiva,
Tetsunari Inamura |
| Delayed and Time-Variant Patrolling Strategies against Attackers with
Local Observation Capabilities |
Carlos Diaz Alvarenga, Nicola Basilico, Stefano Carpin |
| Deriving norms from actions, values and context |
Myrthe Tielman, Catholijn Jonker, M. Birna van Riemsdijk |
| Rethinking the Neutrality Axiom in Judgment Aggregation |
Zoi Terzopoulou, Ulle Endriss |
| Explaining Failures Propagations in the Execution of Multi-Agent Temporal
Plans |
Gianluca Torta, Roberto Micalizio, Samuele Sormano |
| Logically-Constrained Neural Fitted Q-iteration |
Mohammadhosein Hasanbeig, Alessandro Abate, Daniel Kroening |
| A Homophily-Free Community Detection Framework for Trajectories with
Delayed Responses |
Chung-Kyun Han, Shih-Fen Cheng, Pradeep Varakantham |
| Stability of Human-Inspired Agent Societies |
Joe Collenette, Katie Atkinson, Daan Bloembergen, Karl Tuyls |
| Deep Generative and Discriminative Domain Adaptation |
Han Zhao |
| Predictive Execution Monitoring of BDI Recipes |
Mika Barkan, Gal Kaminka |
| Aggregating Citizen Preferences for Public Projects Through Civic
Crowdfunding |
Sankarshan Damle, Moin Hussain Moti, Praphul Chandra, Sujit Gujar |
| The Gift Exchange Game: Managing Opponent Actions |
Steven Damer, Maria Gini, Jeffrey Rosenschein |
| DeepAggregation: A New Approach for Aggregating Incomplete Ranked Lists
using Multi-Layer Graph Embedding |
Rohith D Vallam, Ramasuri Narayanam, Srikanth Tamilselvam, Nicholas
Mattei, Sudhanshu Singh, Shweta Garg, Gyana Parija |
| A Social Choice Perspective on Database Aggregation |
Francesco Belardinelli, Umberto Grandi |
| A Privacy Preserving Multiagent System for Load Balancing in the Smart
Grid |
Shangyu Xie, Yuan Hong, Peng-Jun Wan |
| Collaborative Reinforcement Learning Model for Sustainability of
Cooperation in Sequential Social Dilemmas |
Ritwik Chaudhuri, Kushal Mukherjee, Rohith D Vallam, Ayush Kumar,
Antriksh Mathur, Shweta Garg, Sudhanshu Singh, Gyana Parija, Ramasuri
Narayanam |
| Interpretable Automated Machine Learning in Maana Knowledge Platform |
Fangkai Yang, Alexander Elkholy, Steven Gustafson |
| Teaching Social Behavior through Human Reinforcement for Ad hoc Teamwork:
The STAR Framework |
Shani Alkoby, Avilash Rath, Peter Stone |
| Power indices for team reformation planning under uncertainty |
Jonathan Cohen, Abdel-Illah Mouaddib |
| The StarCraft Multi-Agent Challenge |
Mikayel Samvelyan, Tabish Rashid, Gregory Farquhar, Jakob Foerster,
Christian Schroeder de Witt, Nantas Nardelli, Tim G. J. Rudner, Chia-Man
Hung, Philip H. S. Torr, Shimon Whiteson |
| Generative Adversarial Imitation from Observation |
Faraz Torabi, Garrett Warnell, Peter Stone |
| Verifying Strategic Abilities in Multi-agent Systems with Private
Data-Sharing |
Francesco Belardinelli, Ioana Boureanu, Catalin Dima, Vadim Malvone |
| Curriculum Learning for Tightly Coupled Multiagent Systems |
Golden Rockefeller, Patrick Mannion, Kagan Tumer |
| A Compression-Inspired Framework for Macro Discovery |
Francisco Garcia, Bruno da Silva, Philip Thomas |
| When to stop for safe manipulation in unstructured environments? |
Abdullah Cihan Ak, Arda Inceoglu, Sanem Sariel |
| X*: Anytime Multiagent Planning With Bounded Search |
Kyle Vedder, Joydeep Biswas |
| What Stands-in for a Missing Tool?: A Prototypical Grounded
Knowledge-based Approach to Tool Substitution |
Madhura Thosar, Christian Mueller, Georg Jäger, Till Mossakowski,
Sebastian Zug |
| Training Cooperative Agents for Multi-Agent Reinforcement Learning |
Sushrut Bhalla, Sriram Ganapathi Subramanian, Mark Crowley |
| Long-term Autonomous Mobile Manipulation under Uncertainty |
Michael Lanighan, Roderic Grupen |
| Agent Software is More Complex than Other Software: An Empirical
Investigation |
Alon Zanbar, Gal Kaminka |
| A Property-based Testing Framework for Multi-Agent Systems |
Lars-Åke Fredlund, Clara Benac Earle |
| Removing the Target Network from Deep Q-Networks with Mellowmax Operator |
Seungchan Kim, Kavosh Asadi, Michael Littman, George Konidaris |
| Modeling Human Decision-Making during Hurricanes: From Model to Data
Collection to Prediction |
Nutchanon Yongsatianchot, Stacy Marsella |
| Social Power in Human-Robot Interaction: Towards more Persuasive Robots |
Mojgan Hashemian, Ana Paiva, Samuel Mascarenhas, Pedro Santos, Rui Prada |
| Applying Norms and Sanctions to Promote Cybersecurity Hygiene |
Shubham Goyal, Nirav Ajmeri, Munindar Singh |
| Learn a Robust Policy in Adversarial Games via Playing with an Expert
Opponent |
Jialian Li, Tongzheng Ren, Hang Su, Jun Zhu |
| Smart Targets to Avoid Observation in CTO problem |
Leonardo Ferreira da Costa, Thayanne Franca da Silva, Jose Luis Alves
Leite, Raimundo Juracy Campos Ferro Junior, Raphael Pinheiro de Souza, Joao
Pedro Bernardino Andrade, Gustavo Augusto Lima de Campos |
| The unbroken telephone game: keeping swarms connected |
Vivek Shankar Varadharajan, Bram Adams, Giovanni Beltrame |
| Optimal Risk in Multiagent Blind Tournaments |
Theodore Perkins |
| To be Big Picture Thinker or Detail-Oriented? Utilizing Perceived Gist
Information to Achieve Efficient Convention Emergence with Bilateralism and
Multilateralism |
Shuyue Hu, Chin-wing Leung, Ho-fung Leung, Jiamou Liu |
| The DARPA SocialSim Challenge: Massive Multi-Agent Simulations of the
Github Ecosystem |
Jim Blythe, Emilio Ferrara, Diana Huang, Kristina Lerman, Goran Muric,
Anna Sapienza, Alexey Tregubov, Diogo Pacheco, John Bollenbacher, Alessandro
Flammini, Pik-Mai Hui, Fil Menczer |
| Active Learning with Gaussian Processes for High Throughput Phenotyping |
Sumit Kumar, Wenhao Luo, George Kantor, Katia Sycara |
| Escape Room: A Configurable Testbed for Hierarchical Reinforcement
Learning |
Jacob Menashe, Peter Stone |
| Bribery in Balanced Knockout Tournaments |
Christine Konicki, Virginia Vassilevska Williams |
| Cooperative Multi-Agent Deep Reinforcement Learning in Soccer Domains |
Jim Martin Catacora Ocana, Francesco Riccio, Roberto Capobianco, Daniele
Nardi |
| Explicable Planning as Minimizing Distance from Expected Behavior |
Anagha Kulkarni, Yantian Zha, Tathagata Chakraborti, Satya Gautam
Vadlamudi, Yu Zhang, Subbarao Kambhampati |