HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Dueling Network Architectures for Deep Reinforcement Learning

Ziyu Wang; Tom Schaul; Matteo Hessel; Hado van Hasselt; Marc Lanctot; Nando de Freitas

Dueling Network Architectures for Deep Reinforcement Learning

Abstract

In recent years there have been many successes of using deep representations in reinforcement learning. Still, many of these applications use conventional architectures, such as convolutional networks, LSTMs, or auto-encoders. In this paper, we present a new neural network architecture for model-free reinforcement learning. Our dueling network represents two separate estimators: one for the state value function and one for the state-dependent action advantage function. The main benefit of this factoring is to generalize learning across actions without imposing any change to the underlying reinforcement learning algorithm. Our results show that this architecture leads to better policy evaluation in the presence of many similar-valued actions. Moreover, the dueling architecture enables our RL agent to outperform the state-of-the-art on the Atari 2600 domain.

Code Repositories

wtingda/DeepRLBreakout
tf
Mentioned in GitHub
facebookresearch/Horizon
pytorch
Mentioned in GitHub
nbopardi/smb
tf
Mentioned in GitHub
shehrum/RL_Navigation
pytorch
Mentioned in GitHub
alessandrositta/Flatland_challenge
pytorch
Mentioned in GitHub
R-Sweke/DeepQ-Decoding
tf
Mentioned in GitHub
gouxiangchen/dueling-DQN-pytorch
pytorch
Mentioned in GitHub
dxyang/DQN_pytorch
pytorch
Mentioned in GitHub
utarumo/RL_implementation
tf
Mentioned in GitHub
JuliaPOMDP/DeepQLearning.jl
Mentioned in GitHub
cove9988/TradingGym
Mentioned in GitHub
iDataist/Navigation-with-Deep-Q-Network
pytorch
Mentioned in GitHub
la3lma/chezjulia
tf
Mentioned in GitHub
guillaumeboniface/bananaland
pytorch
Mentioned in GitHub
chainer/chainerrl
pytorch
Mentioned in GitHub
botforge/simplementation
pytorch
Mentioned in GitHub
jsztompka/DuelDQN
pytorch
Mentioned in GitHub
near32/regym
pytorch
Mentioned in GitHub
1jsingh/rl_navigation
pytorch
Mentioned in GitHub
BY571/DQN-Atari-Agents
pytorch
Mentioned in GitHub
jezzarax/drlnd_p1_navigation
pytorch
Mentioned in GitHub
eddynelson/dqn
tf
Mentioned in GitHub
tensorlayer/RLzoo
tf
Mentioned in GitHub
fengsterooni/dql
pytorch
Mentioned in GitHub
rybread1/deep-rl-trex
tf
Mentioned in GitHub
ZainRaza14/deepRL
pytorch
Mentioned in GitHub
la3lma/Chez
tf
Mentioned in GitHub
Adrelf/DRL-navigation
pytorch
Mentioned in GitHub
MEOWMEOW114/nd893-p1-navigation-banana
pytorch
Mentioned in GitHub
opplieam/Pong-Deep-RL
pytorch
Mentioned in GitHub
mindspore-courses/Rainbow-MindSpore
mindspore
Mentioned in GitHub
abryeemessi/Wednesday
Mentioned in GitHub
kmdanielduan/DQN_Family_PyTorch
pytorch
Mentioned in GitHub
JBGUIMBAUD/deep-reenforcement-learning
pytorch
Mentioned in GitHub
xusophia/DataSciFinalProj
pytorch
Mentioned in GitHub
rybread1/DeepRlTrex
tf
Mentioned in GitHub
ethanmclark1/carla_aebs
pytorch
Mentioned in GitHub
KDL-umass/saliency_maps
Mentioned in GitHub
nathanin/pad
tf
Mentioned in GitHub
mohit8935/Deep-Q-Learning-Paper
pytorch
Mentioned in GitHub
HussonnoisMaxence/RL_Algorithms
pytorch
Mentioned in GitHub
NervanaSystems/coach
tf
Mentioned in GitHub
facebookresearch/ReAgent
pytorch
Mentioned in GitHub
mightypirate1/DRL-Tetris
tf
Mentioned in GitHub
austinsilveria/Banana-Collection-DQN
pytorch
Mentioned in GitHub
shashwatsaxena571/DRL-navigation
pytorch
Mentioned in GitHub
marload/DeepRL-TensorFlow2
tf
Mentioned in GitHub
philtabor/Deep-Q-Learning-Paper-To-Code
pytorch
Mentioned in GitHub
Brandon-Rozek/DeepRL
Mentioned in GitHub
FaboNo/DRLND
pytorch
Mentioned in GitHub
SayhoKim/tetrisRL
tf
Mentioned in GitHub
clarky104/carla_aebs
pytorch
Mentioned in GitHub
prajwalgatti/DRL-Navigation
Mentioned in GitHub
ku2482/sac-discrete.pytorch
pytorch
Mentioned in GitHub
MOVzeroOne/DQN
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
atari-games-on-atari-2600-alienPrior+Duel noop
Score: 3941.0
atari-games-on-atari-2600-alienDuel noop
Score: 4461.4
atari-games-on-atari-2600-alienDDQN (tuned) noop
Score: 3747.7
atari-games-on-atari-2600-alienPrior+Duel hs
Score: 823.7
atari-games-on-atari-2600-alienDuel hs
Score: 1486.5
atari-games-on-atari-2600-amidarPrior+Duel hs
Score: 238.4
atari-games-on-atari-2600-amidarDDQN (tuned) noop
Score: 1793.3
atari-games-on-atari-2600-amidarDuel hs
Score: 172.7
atari-games-on-atari-2600-amidarDuel noop
Score: 2354.5
atari-games-on-atari-2600-amidarPrior+Duel noop
Score: 2296.8
atari-games-on-atari-2600-assaultPrior+Duel noop
Score: 11477.0
atari-games-on-atari-2600-assaultPrior+Duel hs
Score: 10950.6
atari-games-on-atari-2600-assaultDDQN (tuned) noop
Score: 5393.2
atari-games-on-atari-2600-assaultDuel noop
Score: 4621.0
atari-games-on-atari-2600-assaultDuel hs
Score: 3994.8
atari-games-on-atari-2600-asterixPrior+Duel hs
Score: 364200.0
atari-games-on-atari-2600-asterixDDQN (tuned) noop
Score: 17356.5
atari-games-on-atari-2600-asterixPrior+Duel noop
Score: 375080.0
atari-games-on-atari-2600-asterixDuel hs
Score: 15840.0
atari-games-on-atari-2600-asterixDuel noop
Score: 28188.0
atari-games-on-atari-2600-asteroidsPrior+Duel noop
Score: 1192.7
atari-games-on-atari-2600-asteroidsDuel noop
Score: 2837.7
atari-games-on-atari-2600-asteroidsDuel hs
Score: 2035.4
atari-games-on-atari-2600-asteroidsDDQN (tuned) noop
Score: 734.7
atari-games-on-atari-2600-atlantisDDQN (tuned) noop
Score: 106056.0
atari-games-on-atari-2600-atlantisDuel noop
Score: 382572.0
atari-games-on-atari-2600-atlantisDuel hs
Score: 445360.0
atari-games-on-atari-2600-atlantisPrior+Duel noop
Score: 395762.0
atari-games-on-atari-2600-bank-heistDDQN (tuned) noop
Score: 1030.6
atari-games-on-atari-2600-bank-heistDuel hs
Score: 1129.3
atari-games-on-atari-2600-bank-heistPrior+Duel noop
Score: 1503.1
atari-games-on-atari-2600-bank-heistDuel noop
Score: 1611.9
atari-games-on-atari-2600-battle-zoneDuel hs
Score: 31320.0
atari-games-on-atari-2600-battle-zoneDuel noop
Score: 37150.0
atari-games-on-atari-2600-battle-zonePrior+Duel noop
Score: 35520.0
atari-games-on-atari-2600-battle-zoneDDQN (tuned) noop
Score: 31700.0
atari-games-on-atari-2600-beam-riderDuel noop
Score: 12164.0
atari-games-on-atari-2600-beam-riderPrior+Duel noop
Score: 30276.5
atari-games-on-atari-2600-beam-riderDDQN (tuned) noop
Score: 13772.8
atari-games-on-atari-2600-beam-riderDuel hs
Score: 14591.3
atari-games-on-atari-2600-berzerkDuel noop
Score: 1472.6
atari-games-on-atari-2600-berzerkDuel hs
Score: 910.6
atari-games-on-atari-2600-berzerkDDQN (tuned) noop
Score: 1225.4
atari-games-on-atari-2600-berzerkPrior+Duel noop
Score: 3409.0
atari-games-on-atari-2600-bowlingDuel noop
Score: 65.5
atari-games-on-atari-2600-bowlingDuel hs
Score: 65.7
atari-games-on-atari-2600-bowlingDDQN (tuned) noop
Score: 68.1
atari-games-on-atari-2600-bowlingPrior+Duel noop
Score: 46.7
atari-games-on-atari-2600-boxingDDQN (tuned) noop
Score: 91.6
atari-games-on-atari-2600-boxingDuel noop
Score: 99.4
atari-games-on-atari-2600-boxingDuel hs
Score: 77.3
atari-games-on-atari-2600-boxingPrior+Duel noop
Score: 98.9
atari-games-on-atari-2600-breakoutPrior+Duel noop
Score: 366.0
atari-games-on-atari-2600-breakoutDuel hs
Score: 411.6
atari-games-on-atari-2600-breakoutDuel noop
Score: 345.3
atari-games-on-atari-2600-breakoutDDQN (tuned) noop
Score: 418.5
atari-games-on-atari-2600-centipedeDuel hs
Score: 4881.0
atari-games-on-atari-2600-centipedeDDQN (tuned) noop
Score: 5409.4
atari-games-on-atari-2600-centipedeDuel noop
Score: 7561.4
atari-games-on-atari-2600-centipedePrior+Duel noop
Score: 7687.5
atari-games-on-atari-2600-chopper-commandDuel noop
Score: 11215.0
atari-games-on-atari-2600-chopper-commandDDQN (tuned) noop
Score: 5809.0
atari-games-on-atari-2600-chopper-commandPrior+Duel noop
Score: 13185.0
atari-games-on-atari-2600-chopper-commandDuel hs
Score: 3784.0
atari-games-on-atari-2600-crazy-climberDuel noop
Score: 143570.0
atari-games-on-atari-2600-crazy-climberDDQN (tuned) noop
Score: 117282.0
atari-games-on-atari-2600-crazy-climberDuel hs
Score: 124566.0
atari-games-on-atari-2600-crazy-climberPrior+Duel noop
Score: 162224.0
atari-games-on-atari-2600-defenderPrior+Duel noop
Score: 41324.5
atari-games-on-atari-2600-defenderDuel noop
Score: 42214.0
atari-games-on-atari-2600-defenderPrior+Duel hs
Score: 34415.0
atari-games-on-atari-2600-demon-attackPrior+Duel noop
Score: 72878.6
atari-games-on-atari-2600-demon-attackDuel noop
Score: 60813.3
atari-games-on-atari-2600-demon-attackDuel hs
Score: 56322.8
atari-games-on-atari-2600-demon-attackDDQN (tuned) noop
Score: 58044.2
atari-games-on-atari-2600-double-dunkDuel noop
Score: 0.1
atari-games-on-atari-2600-double-dunkPrior+Duel noop
Score: -12.5
atari-games-on-atari-2600-double-dunkDDQN (tuned) noop
Score: -5.5
atari-games-on-atari-2600-double-dunkDuel hs
Score: -0.8
atari-games-on-atari-2600-enduroPrior+Duel noop
Score: 2306.4
atari-games-on-atari-2600-enduroDuel hs
Score: 2077.4
atari-games-on-atari-2600-enduroDuel noop
Score: 2258.2
atari-games-on-atari-2600-enduroDDQN (tuned) noop
Score: 1211.8
atari-games-on-atari-2600-fishing-derbyDuel hs
Score: -4.1
atari-games-on-atari-2600-fishing-derbyDuel noop
Score: 46.4
atari-games-on-atari-2600-fishing-derbyPrior+Duel noop
Score: 41.3
atari-games-on-atari-2600-fishing-derbyDDQN (tuned) noop
Score: 15.5
atari-games-on-atari-2600-freewayDuel noop
Score: 0.0
atari-games-on-atari-2600-freewayDuel hs
Score: 0.2
atari-games-on-atari-2600-freewayPrior+Duel noop
Score: 33.0
atari-games-on-atari-2600-freewayDDQN (tuned) noop
Score: 33.3
atari-games-on-atari-2600-frostbiteDuel hs
Score: 2332.4
atari-games-on-atari-2600-frostbitePrior+Duel noop
Score: 7413.0
atari-games-on-atari-2600-frostbiteDuel noop
Score: 4672.8
atari-games-on-atari-2600-frostbiteDDQN (tuned) noop
Score: 1683.3
atari-games-on-atari-2600-gopherPrior+Duel noop
Score: 104368.2
atari-games-on-atari-2600-gopherDuel noop
Score: 15718.4
atari-games-on-atari-2600-gopherDDQN (tuned) noop
Score: 14840.8
atari-games-on-atari-2600-gopherDuel hs
Score: 20051.4
atari-games-on-atari-2600-gravitarDuel noop
Score: 588.0
atari-games-on-atari-2600-gravitarDuel hs
Score: 297.0
atari-games-on-atari-2600-gravitarDDQN (tuned) noop
Score: 412.0
atari-games-on-atari-2600-gravitarPrior+Duel noop
Score: 238.0
atari-games-on-atari-2600-heroPrior+Duel noop
Score: 21036.5
atari-games-on-atari-2600-heroDDQN (tuned) noop
Score: 20130.2
atari-games-on-atari-2600-heroDuel hs
Score: 15207.9
atari-games-on-atari-2600-heroDuel noop
Score: 20818.2
atari-games-on-atari-2600-ice-hockeyDDQN (tuned) noop
Score: -2.7
atari-games-on-atari-2600-ice-hockeyPrior+Duel noop
Score: -0.4
atari-games-on-atari-2600-ice-hockeyDuel noop
Score: 0.5
atari-games-on-atari-2600-ice-hockeyDuel hs
Score: -1.3
atari-games-on-atari-2600-james-bondDuel noop
Score: 1312.5
atari-games-on-atari-2600-james-bondDuel hs
Score: 835.5
atari-games-on-atari-2600-james-bondPrior+Duel noop
Score: 812.0
atari-games-on-atari-2600-james-bondDDQN (tuned) noop
Score: 1358.0
atari-games-on-atari-2600-kangarooPrior+Duel noop
Score: 1792.0
atari-games-on-atari-2600-kangarooDDQN (tuned) noop
Score: 12992.0
atari-games-on-atari-2600-kangarooDuel hs
Score: 10334.0
atari-games-on-atari-2600-kangarooDuel noop
Score: 14854.0
atari-games-on-atari-2600-krullPrior+Duel noop
Score: 10374.4
atari-games-on-atari-2600-krullDuel hs
Score: 8051.6
atari-games-on-atari-2600-krullDuel noop
Score: 11451.9
atari-games-on-atari-2600-krullDDQN (tuned) noop
Score: 7920.5
atari-games-on-atari-2600-kung-fu-masterDDQN (tuned) noop
Score: 29710.0
atari-games-on-atari-2600-kung-fu-masterPrior+Duel noop
Score: 48375.0
atari-games-on-atari-2600-kung-fu-masterDuel hs
Score: 24288.0
atari-games-on-atari-2600-kung-fu-masterDuel noop
Score: 34294.0
atari-games-on-atari-2600-montezumas-revengeDuel hs
Score: 22.0
atari-games-on-atari-2600-ms-pacmanPrior+Duel noop
Score: 3327.3
atari-games-on-atari-2600-ms-pacmanDDQN (tuned) noop
Score: 2711.4
atari-games-on-atari-2600-ms-pacmanDuel noop
Score: 6283.5
atari-games-on-atari-2600-ms-pacmanDuel hs
Score: 2250.6
atari-games-on-atari-2600-name-this-gameDuel hs
Score: 11185.1
atari-games-on-atari-2600-name-this-gamePrior+Duel noop
Score: 15572.5
atari-games-on-atari-2600-name-this-gameDuel noop
Score: 11971.1
atari-games-on-atari-2600-name-this-gameDDQN (tuned) noop
Score: 10616.0
atari-games-on-atari-2600-phoenixPrior+Duel hs
Score: 63597.0
atari-games-on-atari-2600-pongDDQN (tuned) noop
Score: 20.9
atari-games-on-atari-2600-pongDuel noop
Score: 21.0
atari-games-on-atari-2600-pongDuel hs
Score: 18.8
atari-games-on-atari-2600-pongPrior+Duel noop
Score: 20.9
atari-games-on-atari-2600-private-eyeDuel hs
Score: 292.6
atari-games-on-atari-2600-private-eyePrior+Duel noop
Score: 206.0
atari-games-on-atari-2600-private-eyeDuel noop
Score: 103.0
atari-games-on-atari-2600-private-eyeDDQN (tuned) noop
Score: 129.7
atari-games-on-atari-2600-qbertPrior+Duel noop
Score: 18760.3
atari-games-on-atari-2600-qbertDuel hs
Score: 14175.8
atari-games-on-atari-2600-qbertDDQN (tuned) noop
Score: 15088.5
atari-games-on-atari-2600-qbertDuel noop
Score: 19220.3
atari-games-on-atari-2600-river-raidDuel noop
Score: 21162.6
atari-games-on-atari-2600-river-raidDDQN (tuned) noop
Score: 14884.5
atari-games-on-atari-2600-river-raidPrior+Duel noop
Score: 20607.6
atari-games-on-atari-2600-river-raidDuel hs
Score: 16569.4
atari-games-on-atari-2600-road-runnerDuel hs
Score: 58549.0
atari-games-on-atari-2600-road-runnerDuel noop
Score: 69524.0
atari-games-on-atari-2600-road-runnerPrior+Duel noop
Score: 62151.0
atari-games-on-atari-2600-road-runnerDDQN (tuned) noop
Score: 44127.0
atari-games-on-atari-2600-robotankDDQN (tuned) noop
Score: 65.1
atari-games-on-atari-2600-robotankPrior+Duel noop
Score: 27.5
atari-games-on-atari-2600-robotankDuel hs
Score: 62.0
atari-games-on-atari-2600-robotankDuel noop
Score: 65.3
atari-games-on-atari-2600-seaquestDuel hs
Score: 37361.6
atari-games-on-atari-2600-seaquestDDQN (tuned) noop
Score: 16452.7
atari-games-on-atari-2600-seaquestPrior+Duel noop
Score: 931.6
atari-games-on-atari-2600-seaquestDuel noop
Score: 50254.2
atari-games-on-atari-2600-space-invadersDuel hs
Score: 5993.1
atari-games-on-atari-2600-space-invadersPrior+Duel noop
Score: 15311.5
atari-games-on-atari-2600-space-invadersDDQN (tuned) noop
Score: 2525.5
atari-games-on-atari-2600-space-invadersDuel noop
Score: 6427.3
atari-games-on-atari-2600-star-gunnerPrior+Duel noop
Score: 125117.0
atari-games-on-atari-2600-star-gunnerDuel noop
Score: 89238.0
atari-games-on-atari-2600-star-gunnerDuel hs
Score: 90804.0
atari-games-on-atari-2600-star-gunnerDDQN (tuned) noop
Score: 60142.0
atari-games-on-atari-2600-tennisDuel noop
Score: 5.1
atari-games-on-atari-2600-tennisDDQN (tuned) noop
Score: -22.8
atari-games-on-atari-2600-tennisDuel hs
Score: 4.4
atari-games-on-atari-2600-tennisPrior+Duel noop
Score: 0.0
atari-games-on-atari-2600-time-pilotDDQN (tuned) noop
Score: 8339.0
atari-games-on-atari-2600-time-pilotDuel noop
Score: 11666.0
atari-games-on-atari-2600-time-pilotPrior+Duel noop
Score: 7553.0
atari-games-on-atari-2600-time-pilotDuel hs
Score: 6601.0
atari-games-on-atari-2600-tutankhamDDQN (tuned) noop
Score: 218.4
atari-games-on-atari-2600-tutankhamPrior+Duel noop
Score: 245.9
atari-games-on-atari-2600-tutankhamDuel hs
Score: 48.0
atari-games-on-atari-2600-tutankhamDuel noop
Score: 211.4
atari-games-on-atari-2600-up-and-downDDQN (tuned) noop
Score: 22972.2
atari-games-on-atari-2600-up-and-downDuel noop
Score: 44939.6
atari-games-on-atari-2600-up-and-downPrior+Duel noop
Score: 33879.1
atari-games-on-atari-2600-up-and-downDuel hs
Score: 24759.2
atari-games-on-atari-2600-venturePrior+Duel noop
Score: 48.0
atari-games-on-atari-2600-ventureDuel noop
Score: 497.0
atari-games-on-atari-2600-ventureDDQN (tuned) noop
Score: 98.0
atari-games-on-atari-2600-ventureDuel hs
Score: 200.0
atari-games-on-atari-2600-video-pinballDuel noop
Score: 98209.5
atari-games-on-atari-2600-video-pinballDDQN (tuned) noop
Score: 309941.9
atari-games-on-atari-2600-video-pinballPrior+Duel noop
Score: 479197.0
atari-games-on-atari-2600-video-pinballDuel hs
Score: 110976.2
atari-games-on-atari-2600-wizard-of-worPrior+Duel noop
Score: 12352.0
atari-games-on-atari-2600-wizard-of-worDDQN (tuned) noop
Score: 7492.0
atari-games-on-atari-2600-wizard-of-worDuel hs
Score: 7054.0
atari-games-on-atari-2600-wizard-of-worDuel noop
Score: 7855.0
atari-games-on-atari-2600-zaxxonPrior+Duel noop
Score: 13886.0
atari-games-on-atari-2600-zaxxonDDQN (tuned) noop
Score: 10163.0
atari-games-on-atari-2600-zaxxonDuel noop
Score: 12944.0
atari-games-on-atari-2600-zaxxonDuel hs
Score: 10164.0

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp