| Approximation Gradient Error
Variance Reduced Optimization |
Weiye Zhao |
| Credulous Acceptability,
Poison Games and Modal Logic |
Davide Grossi, Simon Rey |
| Learning Efficient
Communication in Cooperative Multi-Agent Environment |
Yuhang Zhao, Xiujun Ma |
| Optimal Bribery in Voting |
Palash Dey |
| Coordinating Sacrifices to
Enhance Social Welfare in Multi-agent Systems |
Han Yu, Zhiqi Shen, Lizhen Cui, Yongqing Zheng, Victor
Lesser |
| Social Mobilization to
Reposition Indiscriminately Parked Shareable Bikes |
Zelei Liu, Han Yu, Leye Wang, Liang Hu, Qiang Yang |
| A Regulation Enforcement
Solution for Multi-agent Reinforcement Learning |
Sun Fan-Yun, Yen-Yu Chang, Yueh-Hua Wu, Shou-De Lin |
| Bayes-ToMoP: A Fast Detection
and Best Response Algorithm Towards Sophisticated Opponents |
Tianpei Yang, Jianye Hao, Zhaopeng Meng, Chongjie Zhang, Yan
Zheng |
| Multi-agent Path Planning
with Non-constant Velocity Motion |
Ngai Meng Kou, Cheng Peng, Xiaowei Yan, Zhiyuan Yang, Heng
Liu, Kai Zhou, Haibing Zhao, Lijun Zhu, Yinghui Xu |
| Installing Resilience in
Distributed Constraint Optimization Operated by Physical Multi-Agent Systems |
Pierre Rust, Gauthier Picard, Fano Ramparany |
| Student-Project-Resource
Matching-Allocation Problems: Two-Sided Matching Meets Resource Allocation |
Anisse Ismaili, Kentaro Yahiro, Makoto Yokoo, Tomoaki
Yamaguchi |
| Complexity and Approximations
in Robust Coalition Formation via Max-Min k-Partitioning |
Anisse Ismaili, Makoto Yokoo, Noam Hazon, Sarit Kraus, Emi
Watanabe |
| Contradict The Machine: a
Hybrid Approach to Identifying Unknown Unknowns |
Colin Vandenhof, Edith Law |
| Invincible Strategies of
Iterated Prisoner's Dilemma |
Shiheng Wang, Fangzhen Lin |
| An Urgency-Dependent Quorum
Sensing Algorithm for N-Site Selection in Autonomous Swarms |
Grace Cai, Don Sofge |
| General-Sum Cyber Deception
Games under Partial Attacker Valuation Information |
Omkar Thakoor, Phebe Vayanos, Christopher Kiekintveld, Milind
Tambe, Haifeng Xu |
| The Representational Capacity
of Action-Value Networks for Multi-Agent Reinforcement Learning |
Jacopo Castellini, Frans Oliehoek, Rahul Savani, Shimon
Whiteson |
| Simple Contrapositive
Assumption-Based Frameworks |
Ofer Arieli, Jesse Heyninck |
| Optimising Worlds to Evaluate
and Influence Reinforcement Learning Agents |
Richard Everett, Adam Cobb, Stephen Roberts, Andrew
Markham |
| Broken Signals in Security
Games: Coordinating Patrollers and Sensors in the Real World |
Elizabeth Bondi, Hoon Oh, Haifeng Xu, Fei Fang, Bistra
Dilkina, Milind Tambe |
| Probabilistic
resource-bounded alternating-time temporal logic |
Hoang Nga Nguyen, Abdur Rakib |
| Fair Division of Indivisible
Goods Among Strategic Agents |
Siddharth Barman, Ganesh Ghalme, Shivika Narang, Shweta Jain,
Pooja Kulkarni |
| A Polynomial-time Fragment of
Epistemic Probabilistic Argumentation |
Nico Potyka |
| Polynomial-Time Multi-Agent
Pathfinding with Heterogeneous and Self-Interested Agents |
Manao Machida |
| Facility Location for Three
Agents |
Reshef Meir |
| Distributed Policy Iteration
for Scalable Approximation of Cooperative Multi-Agent Policies |
Thomy Phan, Kyrill Schmid, Lenz Belzner, Thomas Gabor,
Sebastian Feld, Claudia Linnhoff-Popien |
| Learning Factored Markov
Decision Processes with Unawareness |
Craig Innes, Alex Lascarides |
| Reachability and Coverage
Planning for Connected Agents |
Tristan Charrier, Arthur Queffelec, Ocan Sankur, Francois
Schwarzentruber |
| The Complexity of the
Possible Winner Problem with Partitioned Preferences |
Batya Kenig |
| Avoiding Social
Disappointment in Elections |
Mohammad Ali Javidian, Pooyan Jamshidi, Rasoul Ramezanian |
| A Q-values Sharing Framework
for Multiple Independent Q-learners |
Changxi Zhu, Ho-fung Leung, Shuyue Hu, Yi Cai |
| Multiagent Adversarial
Inverse Reinforcement Learning |
Ermo Wei, Drew Wicke, Sean Luke |
| Landmark Based Reward Shaping
in Reinforcement Learning with Hidden States |
Alper Demir, Erkin Çilden, Faruk Polat |
| Personality-Based
Representations of Imperfect-Recall Games |
Andrea Celli, Nicola Gatti, Giulia Romano |
| Generating Voting Rules from
Random Relations |
Nic Wilson |
| Multi-Agent Hierarchical
Reinforcement Learning with Dynamic Termination |
Dongge Han, Wendelin Boehmer, Michael Wooldridge, Alex
Rogers |
| Dynamic Trip-Vehicle Dispatch
with Scheduled and On-Demand Requests |
Taoan Huang, Bohui Fang, Hoon Oh, Xiaohui Bei, Fei Fang |
| Cooperating in Long-term
Relationships with Time-Varying Structure |
Jacob Crandall, Huy Pham |
| Regular Decision Processes: Modelling Dynamic Systems without Using Hidden Variables |
Ronen Brafman, Giuseppe De Giacomo |
| On Enactability of Agent
Interaction Protocols: Towards a Unified Approach |
Angelo Ferrando, Michael Winikoff, Stephen Cranefield, Viviana
Mascardi, Frank Dignum |
| MARL-PPS: Multi-agent
Reinforcement Learning with Periodic Parameter Sharing |
Safa Cicek, Alireza Nakhaei, Stefano Soatto, Kikuo
Fujimura |
| A New Constraint Satisfaction
Perspective on Multi-Agent Path Finding |
Jiangxing Wang, Jiaoyang Li, Hang Ma, Sven Koenig, T. K.
Satish Kumar |
| Entailment Functions and
Reasoning Under Inconsistency |
Yakoub Salhi |
| Vote for Me! Election Control
via Social Influence in Arbitrary Scoring Rule Voting Systems |
Federico Corò, Emilio Cruciani, Gianlorenzo D'Angelo, Stefano
Ponziani |
| Coordinated Multiagent
Reinforcement Learning for Teams of Mobile Sensing Robots |
Chao Yu, Xin Wang, Zhanbo Feng |
| Learning through Probing: a
decentralized reinforcement learning architecture for social dilemmas |
Nicolas Anastassacos, Mirco Musolesi |
| MCTS-based Automated
Negotiation Agent |
Cédric Buron, Zahia Guessoum, Sylvain Ductor |
| Towards a “Master Algorithm”
for Forming Faster Conventions On Various Networks |
Mohammad Hasan |
| The Complexity of Additive
Committee Selection with Outliers |
Yongjie Yang, Jianxin Wang |
| Maximin-Aware Allocations of Indivisible Goods |
Hau Chan, Jing Chen, Bo Li, Xiaowei Wu |
| Advice Replay Approach for
Richer Knowledge Transfer in Teacher Student Framework |
Vaibhav Gupta, Daksh Anand, Praveen Paruchuri, Balaraman
Ravindran |
| Proportional Representation
in Elections: STV vs PAV |
Piotr Faliszewski, Piotr Skowron, Stanislaw Szufa, Nimrod
Talmon |
| Simple Contest Enhancers |
Michal Habani, Priel Levy, David Sarne |
| Temporal Information Design
in Contests |
Priel Levy, David Sarne, Yonatan Aumann |
| Policy Networks: A Framework
for Scalable Integration of Multiple Decision-Making Models |
Kyle Wray, Shlomo Zilberstein |
| Learning Self-Game-Play
Agents for Combinatorial Optimization Problems |
Ruiyang Xu, Karl Lieberherr |
| Multiagent Monte Carlo Tree
Search |
Nicholas Zerbel, Logan Yliniemi |
| Using surrogate models to
calibrate agent-based model parameters under data scarcity |
Priscilla Avegliano, Jaime Sichman |
| Learning Simulation-Based
Games from Data |
Enrique Areyan Viqueira, Cyrus Cousins, Amy Greenwald, Eli
Upfal |
| Maxmin Share Fair Allocation
of Indivisible Chores to Asymmetric Agents |
Haris Aziz, Hau Chan, Bo Li |
| Modeling Random Guessing and
Task Difficulty for Truth Discovery in Crowdsourcing |
Yi Yang, Quan Bai, Qing Liu |
| Attention-based Deep
Reinforcement Learning for Multi-view Environments |
Elaheh Barati, Xuewen Chen |
| Generating an Agent Taxonomy
Using Topological Data Analysis |
Samarth Swarup, Reza Rezazadegan |
| Warning Time: Optimizing
Strategic Signaling for Security Against Boundedly Rational Adversaries |
Sarah Cooney, Phebe Vayanos, Thanh Nguyen, Cleotilde Gonzalez,
Christian Lebiere, Edward Cranford, Milind Tambe |
| Optimal Sequential Planning
for Communicative Actions: A Bayesian Approach |
Piotr Gmytrasiewicz, Sarit Adhikari |
| Coordination Structures
Generated by Deep Reinforcement Learning in Distributed Task Executions |
Yuki Miyashita, Toshiharu Sugawara |
| Memory Based Multiagent One
Shot Learning |
Shauharda Khadka, Connor Yates, Kagan Tumer |
| Robustness against Agent
Failure in Hedonic Games |
Ayumi Igarashi, Kazunori Ota, Yuko Sakurai, Makoto Yokoo |
| Obvious Strategyproofness,
Bounded Rationality and Approximation |
Diodato Ferraioli, Carmine Ventre |
| An Optimal Rewiring Strategy
for Cooperative Multiagent Social Learning |
Hongyao Tang, Jianye Hao, Li Wang, Zan Wang, Tim Baarslag |
| A dynamic aleatoric calculus
for reasoning in games of bluffing and chance |
Tim French, Andrew Gozzard, Mark Reynolds |
| A Truthful,
Privacy-Preserving, Approximately Efficient Combinatorial Auction For
Single-minded Bidders |
Sankarshan Damle, Boi Faltings, Sujit Gujar |
| Cooperative Routing with
Heterogeneous Vehicles |
Keisuke Otaki, Satoshi Koide, Ayano Okoso, Tomoki Nishi |
| On the maximization of
influence over an unknown social network |
Kexiu Song, Jiamou Liu, Bo Yan, Yiping Liu, Hongyi Su, Hong
Zheng |
| How to get the most from
goods donated to charities |
Christopher Culley, Ji Qi, Carmine Ventre |
| Actor-Critic Algorithms for
Constrained Multi-agent Reinforcement Learning |
Raghuram Bharadwaj Diddigi, Sai Koti Reddy Danda,
Prabuchandran Krithivasan Jayachandran, Shalabh Bhatnagar |
| Meta-Strategy for Multi-Time
Negotiation: A Multi-Armed Bandit Approach |
Ryohei Kawata, Katsuhide Fujita |
| Stackelberg Equilibrium
approximation in general-sum extensive-form games with double-oracle sampling
method |
Jan Karwowski, Jacek Mańdziuk |
| Thompson Sampling Based
Multi-Armed-Bandit Mechanism Using Neural Networks |
Manisha Padala, Sujit Gujar |
| Computing Stable Solutions in
Threshold Network Flow Games With Bounded Treewidth |
Aldo Pacchiano, Yoram Bachrach |
| Hybrid BiLSTM-Siamese Network
for Relation Extraction |
Zeyuan Cui, Shijun Liu |
| Efficient City-Scale
Patrolling Using Decomposition and Grafting |
Wanyuan Wang, Zichen Dong, Bo An, Yichuan Jiang |
| Risk Averse Reinforcement
Learning for Mixed Multi-agent Environments |
Sai Koti Reddy Danda, Amrita Saha, Srikanth Tamilselvam,
Priyanka Agrawal, Pankaj Dayama |
| Evidence Propagation and
Consensus Formation in Noisy Environments |
Michael Crosscombe, Jonathan Lawry |
| Emergence of
Scenario-Appropriate Collaborative Behaviors for Teams of Robotic Bodyguards |
Hassam Sheikh, Ladislau Bölöni |
| The Imitation Game: Learned
reciprocity in Markov games |
Tom Eccles, Edward Hughes, Steven Wheelwright, Joel Leibo,
János Kramár |
| From Hotelling to Load
Balancing: Approximation and the Principle of Minimum Differentiation |
Matthias Feldotto, Pascal Lenzner, Louise Molitor, Alexander
Skopalik |
| Online Motion Concept
Learning: A novel algorithm for sample-efficient learning and recognition of
human actions |
Miguel Vasco, Francisco Melo, David Martins de Matos, Ana
Paiva, Tetsunari Inamura |
| Automatic Feature Engineering
by Deep Reinforcement Learning |
Jianyu Zhang, Jianye Hao, Françoise Fogelman-Soulié, Zan
Wang |
| Rethinking the Neutrality
Axiom in Judgment Aggregation |
Zoi Terzopoulou, Ulle Endriss |
| Explaining Failures
Propagations in the Execution of Multi-Agent Temporal Plans |
Gianluca Torta, Roberto Micalizio, Samuele Sormano |
| Logically-Constrained Neural
Fitted Q-iteration |
Mohammadhosein Hasanbeig, Alessandro Abate, Daniel
Kroening |
| A Homophily-Free Community
Detection Framework for Trajectories with Delayed Responses |
Chung-Kyun Han, Shih-Fen Cheng, Pradeep Varakantham |
| Stability of Human-Inspired
Agent Societies |
Joe Collenette, Katie Atkinson, Daan Bloembergen, Karl
Tuyls |
| Deep Generative and
Discriminative Domain Adaptation |
Han Zhao |
| Exploration in the face of
Parametric and Intrinsic Uncertainties |
Borislav Mavrin, Shangtong Zhang, Hengshuai Yao, Linglong
Kong |
| Predictive Execution
Monitoring of BDI Recipes |
Mika Barkan, Gal Kaminka |
| Priority driven Local
Optimization for Crowd Simulation |
Himangshu Saikia, Fangkai Yang, Christopher Peters |
| Aggregating Citizen
Preferences for Public Projects Through Civic Crowdfunding |
Sankarshan Damle, Moin Hussain Moti, Praphul Chandra, Sujit
Gujar |
| Adaptive multi-agent system
for situated task allocation |
Quentin Baert, Anne-Cécile Caron, Maxime Morge,
Jean-Christophe Routier |
| The Gift Exchange Game: Managing Opponent Actions |
Steven Damer, Maria Gini, Jeffrey Rosenschein |
| DeepAggregation: A New
Approach for Aggregating Incomplete Ranked Lists using Multi-Layer Graph
Embedding |
Rohith D Vallam, Ramasuri Narayanam, Srikanth Tamilselvam,
Nicholas Mattei, Sudhanshu Singh, Shweta Garg, Gyana Parija |
| A Social Choice Perspective
on Database Aggregation |
Francesco Belardinelli, Umberto Grandi |
| A Privacy Preserving
Multiagent System for Load Balancing in the Smart Grid |
Shangyu Xie, Yuan Hong, Peng-Jun Wan |
| Collaborative Reinforcement
Learning Model for Sustainability of Cooperation in Sequential Social
Dilemmas |
Ritwik Chaudhuri, Kushal Mukherjee, Rohith D Vallam, Ayush
Kumar, Antriksh Mathur, Shweta Garg, Sudhanshu Singh, Gyana Parija, Ramasuri
Narayanam |
| A Truthful Online Mechanism
for Allocating Fog Computing Resources |
Fan Bi, Sebastian Stein, Enrico Gerding, Nick Jennings, Tom La
Porta |
| Reinforcement Learning with
Derivative-Free Exploration |
Xionghui Chen, Yang Yu |
| Strategic Majoritarian Voting
with Propositional Goals |
Arianna Novaro, Umberto Grandi, Dominique Longin, Emiliano
Lorini |
| Teaching Social Behavior
through Human Reinforcement for Ad hoc Teamwork: The STAR Framework |
Shani Alkoby, Avilash Rath, Peter Stone |
| Classification of Contractual
Conflicts via Learning of Semantic Representations |
João Paulo Aires, Roger Granada, Juarez Monteiro, Rodrigo
Coelho Barros, Felipe Meneguzzi |
| Deep Fictitious Play for
Games with Continuous Action Spaces |
Nitin Kamra, Umang Gupta, Kai Wang, Fei Fang, Yan Liu, Milind
Tambe |
| Power indices for team
reformation planning under uncertainty |
Jonathan Cohen, Abdel-Illah Mouaddib |
| Verifying Strategic Abilities
in Multi-agent Systems with Private Data-Sharing |
Francesco Belardinelli, Ioana Boureanu, Catalin Dima, Vadim
Malvone |
| Masquerade Attack Detection
Through Observation Planning for Multi-Robot Systems |
Kacper Wardega, Wenchao Li, Roberto Tron |
| Meta-learning of Bidding
Agent with Knowledge Gradient in a Fully Agent-based Sponsored Search Auction
Simulator |
Donghun Lee, Warren Powell |
| Curriculum Learning for
Tightly Coupled Multiagent Systems |
Golden Rockefeller, Patrick Mannion, Kagan Tumer |
| A Compression-Inspired
Framework for Macro Discovery |
Francisco Garcia, Bruno da Silva, Philip Thomas |
| A Meta-MDP Approach to
Exploration for Lifelong Reinforcement Learning |
Francisco Garcia, Philip Thomas |
| X*: Anytime Multiagent
Planning With Bounded Search |
Kyle Vedder, Joydeep Biswas |
| Report-Sensitive
Spot-checking in Peer Grading |
Hedayat Zarkoob, Kevin Leyton-Brown, Hu Fu |
| Training Cooperative Agents
for Multi-Agent Reinforcement Learning |
Sushrut Bhalla, Sriram Ganapathi Subramanian, Mark
Crowley |
| Toward Robust Policy
Summarization |
Isaac Lage, Daphna Lifschitz, Finale Doshi-Velez, Ofra
Amir |
| Manipulative Design of
Scoring Systems |
Dorothea Baumeister, Tobias Hogrebe |
| Removing the Target Network
from Deep Q-Networks with Mellowmax Operator |
Seungchan Kim, Kavosh Asadi, Michael Littman, George
Konidaris |
| Modeling Human
Decision-Making during Hurricanes: From Model to Data Collection to
Prediction |
Nutchanon Yongsatianchot, Stacy Marsella |
| Preference Learning in
Automated Negotiation Using Gaussian Uncertainty Models |
Haralambie Leahu, Michael Kaisers, Tim Baarslag |
| Social Power in Human-Robot
Interaction: Towards more Persuasive Robots |
Mojgan Hashemian, Ana Paiva, Samuel Mascarenhas, Pedro Santos,
Rui Prada |
| Designing Emergent Swarm
Behaviors using Behavior Trees and Grammatical Evolution |
Aadesh Neupane, Michael A. Goodrich |
| Multi-Agent Learning and
Coordination with Clustered Deep Q-Network |
Simon Pageaud, Véronique Deslandres, Vassilissa Lehoux, Salima
Hassas |
| Applying Norms and Sanctions
to Promote Cybersecurity Hygiene |
Shubham Goyal, Nirav Ajmeri, Munindar Singh |
| Robust Monitoring on Graphs
with an Application to Suicide Prevention in Social Networks |
Aida Rahmattalabi, Phebe Vayanos, Anthony Fulginiti, Milind
Tambe |
| Learn a Robust Policy in
Adversarial Games via Playing with an Expert Opponent |
Jialian Li, Tongzheng Ren, Hang Su, Jun Zhu |
| Smart Targets to Avoid
Observation in CTO problem |
Leonardo Ferreira da Costa, Thayanne França da Silva, José
Luis Alves Leite, Raimundo Juracy Campos Ferro Junior, Raphael Pinheiro de
Souza, João Pedro Bernardino Andrade, Gustavo Augusto Lima de Campos |
| The Rise and Fall of Complex
Family Structures: Coalition Formation, Stability, and Power Struggle |
Angelina Brilliantova, Anton Pletenev, Hadi Hosseini |
| Optimal Risk in Multiagent
Blind Tournaments |
Theodore Perkins |
| To be Big Picture Thinker or
Detail-Oriented? Utilizing Perceived Gist Information to Achieve Efficient
Convention Emergence with Bilateralism and Multilateralism |
Shuyue Hu, Chin-wing Leung, Ho-fung Leung, Jiamou Liu |
| The DARPA SocialSim
Challenge: Massive Multi-Agent Simulations of the Github Ecosystem |
Jim Blythe, Emilio Ferrara, Diana Huang, Kristina Lerman,
Goran Muric, Anna Sapienza, Alexey Tregubov, Diogo Pacheco, John
Bollenbacher, Alessandro Flammini, Pik-Mai Hui, Fil Menczer |
| Object Exchangability in
Reinforcement Learning |
John Mern, Dorsa Sadigh, Mykel Kochenderfer |
| Escape Room: A Configurable
Testbed for Hierarchical Reinforcement Learning |
Jacob Menashe, Peter Stone |
| A Selective Exploration
Method for Policy Transfer in Reinforcement Learning |
Akshay Narayan, Tze Yun Leong |
| Two-stage n-person prisoner's
dilemma with social preferences |
Seji Takanashi, Makoto Yokoo |
| Bribery in Balanced Knockout
Tournaments |
Christine Konicki, Virginia Vassilevska Williams |
| Fairness Through the Lens of
Proportional Equality |
Arpita Biswas, Suvam Mukherjee |
| Recognising and Explaining
Bidding Strategies in Negotiation Support Systems |
Vincent Koeman, Koen Hindriks, Jonathan Gratch, Catholijn
Jonker |
| Domain Adaptation for
Reinforcement Learning on the Atari |
Thomas Carr, Maria Chli, George Vogiatzis |