HyperAI超神经

Openai Gym On Pendulum V1

评估指标

Action Repetition
Average Decisions
Mean Reward

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称Action RepetitionAverage DecisionsMean Reward
creating-hierarchical-dispositions-of-needs.807338.6-125.02
temporally-layered-architecture-for-efficient.703262.31-154.92