HyperAI超神经

Atari Games On Atari 2600 Venture

评估指标

Score

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称Score
train-a-real-world-local-path-planner-in-one291
deep-reinforcement-learning-with-double-q136.0
distributional-reinforcement-learning-with-143.9
generalized-data-distribution-iteration2000
the-arcade-learning-environment-an-evaluation66
increasing-the-action-gap-new-operators-for198.69
online-and-offline-reinforcement-learning-by1731.47
distributed-prioritized-experience-replay1813
human-level-control-through-deep380.0
exploration-by-self-supervised-exploitation2138
count-based-exploration-in-feature-space-for1169.2
模型 120.6
count-based-exploration-with-neural-density82.2
deep-exploration-via-bootstrapped-dqn212.5
first-return-then-explore2281
count-based-exploration-with-neural-density48.0
massively-parallel-methods-for-deep523.4
a-distributional-perspective-on-reinforcement1520.0
deep-reinforcement-learning-with-double-q21.0
generalized-data-distribution-iteration2035
impala-scalable-distributed-deep-rl-with0.00
evolution-strategies-as-a-scalable760.0
count-based-exploration-with-the-successor1241.8
dueling-network-architectures-for-deep48.0
exploration-by-random-network-distillation1859
recurrent-experience-replay-in-distributed1970.7
count-based-exploration-in-feature-space-for0.0
dna-proximal-policy-optimization-with-a-dual0
asynchronous-methods-for-deep-reinforcement25.0
learning-values-across-many-orders-of1172.0
rudder-return-decomposition-for-delayed1350
large-scale-study-of-curiosity-driven416
prioritized-experience-replay94.0
asynchronous-methods-for-deep-reinforcement19.0
dueling-network-architectures-for-deep497.0
deep-reinforcement-learning-with-double-q29.0
mastering-atari-with-discrete-world-models-12
implicit-quantile-networks-for-distributional1318
unifying-count-based-exploration-and0.0
dueling-network-architectures-for-deep98.0
asynchronous-methods-for-deep-reinforcement23.0
deep-reinforcement-learning-with-double-q163.0
evolving-simple-programs-for-playing-atari0
exploration-by-self-supervised-exploitation1787
exploration-a-study-of-count-based445.0
policy-optimization-with-penalized-point36.33
agent57-outperforming-the-atari-human2623.71
noisy-networks-for-exploration815
self-imitation-learning0
prioritized-experience-replay54.0
dueling-network-architectures-for-deep200.0
generalized-data-distribution-iteration2000
mastering-atari-go-chess-and-shogi-by0.40
incentivizing-exploration-in-reinforcement0.0
exploration-by-self-supervised-exploitation2188