HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

Ryan Lowe; Yi Wu; Aviv Tamar; Jean Harb; Pieter Abbeel; Igor Mordatch

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

Abstract

We explore deep reinforcement learning methods for multi-agent domains. We begin by analyzing the difficulty of traditional algorithms in the multi-agent case: Q-learning is challenged by an inherent non-stationarity of the environment, while policy gradient suffers from a variance that increases as the number of agents grows. We then present an adaptation of actor-critic methods that considers action policies of other agents and is able to successfully learn policies that require complex multi-agent coordination. Additionally, we introduce a training regimen utilizing an ensemble of policies for each agent that leads to more robust multi-agent policies. We show the strength of our approach compared to existing methods in cooperative as well as competitive scenarios, where agent populations are able to discover various physical and informational coordination strategies.

Code Repositories

darshil333/CSE574
Mentioned in GitHub
qi-pang/mdpfuzz
Mentioned in GitHub
bonniesjli/MADDPG_Tennis
pytorch
Mentioned in GitHub
jyqhahah/rl_maddpg_matd3
pytorch
Mentioned in GitHub
NeuroCSUT/intentions
tf
Mentioned in GitHub
bonniesjli/MADDPG_Tennis_UnityML
pytorch
Mentioned in GitHub
krasing/DRLearningCollaboration
pytorch
Mentioned in GitHub
rainandwind1/MERL
pytorch
Mentioned in GitHub
shariqiqbal2810/maddpg-pytorch
pytorch
Mentioned in GitHub
openai/maddpg
tf
Mentioned in GitHub
goldbattle/snakes_mal
tf
Mentioned in GitHub
AleXander-Tsui/MPE
Mentioned in GitHub
jingdic/rgmcomm
pytorch
Mentioned in GitHub
zowiezhang/stas
pytorch
Mentioned in GitHub
openai/multiagent-particle-envs
Official
Mentioned in GitHub
RL-WFU/multi_agent_attack
tf
Mentioned in GitHub
Yutongamber/MADDPG
pytorch
Mentioned in GitHub
zoeyuchao/MPE-pytorch
pytorch
Mentioned in GitHub
biemann/Collaboration-and-Competition
pytorch
Mentioned in GitHub
baicenxiao/shaping-advice
tf
Mentioned in GitHub
marwanihab/RL_Tag_Game
pytorch
Mentioned in GitHub
kargarisaac/macrpo
pytorch
Mentioned in GitHub
mauricemager/multiagent-robot
tf
Mentioned in GitHub
zoeyuchao/MPEnew-pytorch
pytorch
Mentioned in GitHub
JinTanda/MADDPG_env
Mentioned in GitHub
cyanrain7/trpo-in-marl
pytorch
Mentioned in GitHub
baradist/multiagent-particle-envs
pytorch
Mentioned in GitHub
xuehy/pytorch-maddpg
tf
Mentioned in GitHub
baoqianwang/iros22_darl1n
tf
Mentioned in GitHub
caslab-vt/SARNet
tf
Mentioned in GitHub
google/maddpg-replication
tf
Mentioned in GitHub
JohannesAck/MATD3implementation
tf
Mentioned in GitHub
isp1tze/MAProj
pytorch
Mentioned in GitHub
Stippler/cow-simulator
pytorch
Mentioned in GitHub
ksajan/DDPG-MAPE
tf
Mentioned in GitHub
debajit15kgp/multiagent-envs
Mentioned in GitHub
jansenkeith501/CS295-MADDPG
Mentioned in GitHub
Chan1998/MAAC
pytorch
Mentioned in GitHub
rainandwind1/MADDPG-reconstruct
pytorch
Mentioned in GitHub
morning9393/HAPPO-HATRPO
pytorch
Mentioned in GitHub
starry-sky6688/MADDPG
pytorch
Mentioned in GitHub
quantumiracle/mars
pytorch
Mentioned in GitHub
JohannesAck/tf2multiagentrl
tf
Mentioned in GitHub
rainandwind1/Maddpg_multiagent
pytorch
Mentioned in GitHub
raoshashank/Tennis-with-MADDPG
pytorch
Mentioned in GitHub
pr-shukla/maddpg-keras
tf
Mentioned in GitHub
biorobotics/PRD_environments
Mentioned in GitHub
madhur-tandon/RL-Project
pytorch
Mentioned in GitHub
facebookresearch/benchmarl
pytorch
Mentioned in GitHub
Ah31/maddpg_pytorch
pytorch
Mentioned in GitHub
thechrisyoon08/marl
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
smac-on-smac-def-armored-sequentialMADDPG
Median Win Rate: 90.6
smac-on-smac-def-infantry-sequentialMADDPG
Median Win Rate: 100
smac-on-smac-def-outnumbered-sequentialMADDPG
Median Win Rate: 81.3
smac-on-smac-off-complicated-sequentialMADDPG
Median Win Rate: 0.0
smac-on-smac-off-distant-sequentialMADDPG
Median Win Rate: 0.0
smac-on-smac-off-hard-sequentialMADDPG
Median Win Rate: 0.0
smac-on-smac-off-near-sequentialMADDPG
Median Win Rate: 75.0
smac-on-smac-off-superhard-sequentialMADDPG
Median Win Rate: 0.0

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp