Openai Gym On Walker2D V2

Action Repetition

Average Decisions

Mean Reward

评测结果

各个模型在此基准测试上的表现结果

				Paper Title	Repository
AWR	-	-	5813	Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
TLA	.4745	513.12	3878.41	Optimizing Attention and Cognitive Control Costs Using Temporally-Layered Architectures

0 of 2 row(s) selected.