HyperAI超神经

Atari Games On Atari 2600 Freeway

评估指标

Score

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称Score
asynchronous-methods-for-deep-reinforcement0.1
deep-exploration-via-bootstrapped-dqn33.9
soft-actor-critic-for-discrete-action4.4
dueling-network-architectures-for-deep0.0
dueling-network-architectures-for-deep0.2
asynchronous-methods-for-deep-reinforcement0.1
large-scale-study-of-curiosity-driven32.8
policy-optimization-with-penalized-point21.21
massively-parallel-methods-for-deep10.2
discrete-latent-space-world-models-for29
first-return-then-explore34
prioritized-experience-replay28.9
count-based-exploration-with-neural-density31.7
distributed-prioritized-experience-replay33.7
learning-values-across-many-orders-of33.4
count-based-exploration-in-feature-space-for0.0
evolution-strategies-as-a-scalable31.0
increasing-the-action-gap-new-operators-for31.72
recurrent-experience-replay-in-distributed32.5
the-arcade-learning-environment-an-evaluation22.5
模型 2119.7
unifying-count-based-exploration-and30.48
optimizing-the-neural-architecture-of22
evolving-simple-programs-for-playing-atari28.2
incentivizing-exploration-in-reinforcement27.0
mastering-atari-with-discrete-world-models-133
agent57-outperforming-the-atari-human32.59
dueling-network-architectures-for-deep33.0
deep-reinforcement-learning-with-double-q28.8
prioritized-experience-replay33.7
a-distributional-perspective-on-reinforcement33.9
distributional-reinforcement-learning-with-134
generalized-data-distribution-iteration34
mastering-atari-go-chess-and-shogi-by33.03
generalized-data-distribution-iteration34
curl-contrastive-unsupervised-representations27.9
count-based-exploration-with-the-successor29.5
the-arcade-learning-environment-an-evaluation0.4
human-level-control-through-deep30.3
count-based-exploration-with-neural-density33.0
deep-reinforcement-learning-with-double-q28.2
increasing-the-action-gap-new-operators-for32.3
self-imitation-learning32.2
asynchronous-methods-for-deep-reinforcement0.1
generalized-data-distribution-iteration34
count-based-exploration-in-feature-space-for29.9
dna-proximal-policy-optimization-with-a-dual33
online-and-offline-reinforcement-learning-by33.87
implicit-quantile-networks-for-distributional34
deep-reinforcement-learning-with-double-q26.9
dueling-network-architectures-for-deep33.3
deep-reinforcement-learning-with-double-q30.8
impala-scalable-distributed-deep-rl-with0.00
gdi-rethinking-what-makes-reinforcement34
optimizing-the-neural-architecture-of22
exploration-a-study-of-count-based34.0
train-a-real-world-local-path-planner-in-one33.9
noisy-networks-for-exploration34
the-arcade-learning-environment-an-evaluation19.1