HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Evolution Strategies as a Scalable Alternative to Reinforcement Learning

Tim Salimans; Jonathan Ho; Xi Chen; Szymon Sidor; Ilya Sutskever

Evolution Strategies as a Scalable Alternative to Reinforcement Learning

Abstract

We explore the use of Evolution Strategies (ES), a class of black box optimization algorithms, as an alternative to popular MDP-based RL techniques such as Q-learning and Policy Gradients. Experiments on MuJoCo and Atari show that ES is a viable solution strategy that scales extremely well with the number of CPUs available: By using a novel communication strategy based on common random numbers, our ES implementation only needs to communicate scalars, making it possible to scale to over a thousand parallel workers. This allows us to solve 3D humanoid walking in 10 minutes and obtain competitive results on most Atari games after one hour of training. In addition, we highlight several advantages of ES as a black box optimization technique: it is invariant to action frequency and delayed rewards, tolerant of extremely long horizons, and does not need temporal discounting or value function approximation.

Code Repositories

cesch97/NeuroEvolution
Mentioned in GitHub
patniemeyer/ga-autoencoder
pytorch
Mentioned in GitHub
rasmusbergpalm/evostrat
pytorch
Mentioned in GitHub
silvialuu/DRL-2018
pytorch
Mentioned in GitHub
czen88/qtrader
tf
Mentioned in GitHub
nnaisense/pgpelib
pytorch
Mentioned in GitHub
alisidd/Evolution-Strategies
tf
Mentioned in GitHub
evaboost/evaboost
Mentioned in GitHub
aspk/space_battle
Mentioned in GitHub
shakti365/nes
Mentioned in GitHub
fiberleif/evolution-strategies
tf
Mentioned in GitHub
enajx/ES
pytorch
Mentioned in GitHub
ShangtongZhang/DistributedES
pytorch
Mentioned in GitHub
susanwe/ope_worldmodels
Mentioned in GitHub
neuro-soup/es-torch
jax
Mentioned in GitHub
FlixCoder/rust-es-optimizer
Mentioned in GitHub
openai/evolution-strategies-starter
Official
tf
Mentioned in GitHub
atgambardella/pytorch-es
pytorch
Mentioned in GitHub
neilsgp/RL-Algorithms
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
atari-games-on-atari-2600-alienES FF (1 hour) noop
Score: 994.0
atari-games-on-atari-2600-amidarES FF (1 hour) noop
Score: 112.0
atari-games-on-atari-2600-assaultES FF (1 hour) noop
Score: 1673.9
atari-games-on-atari-2600-asterixES FF (1 hour) noop
Score: 1440
atari-games-on-atari-2600-asteroidsES FF (1 hour) noop
Score: 1562.0
atari-games-on-atari-2600-atlantisES FF (1 hour) noop
Score: 1267410.0
atari-games-on-atari-2600-bank-heistES FF (1 hour) noop
Score: 225.0
atari-games-on-atari-2600-battle-zoneES FF (1 hour) noop
Score: 16600.0
atari-games-on-atari-2600-beam-riderES FF (1 hour) noop
Score: 744.0
atari-games-on-atari-2600-berzerkES FF (1 hour) noop
Score: 686.0
atari-games-on-atari-2600-bowlingES FF (1 hour) noop
Score: 30
atari-games-on-atari-2600-boxingES FF (1 hour) noop
Score: 49.8
atari-games-on-atari-2600-breakoutES FF (1 hour) noop
Score: 9.5
atari-games-on-atari-2600-centipedeES FF (1 hour) noop
Score: 7783.9
atari-games-on-atari-2600-chopper-commandES FF (1 hour) noop
Score: 3710.0
atari-games-on-atari-2600-crazy-climberES FF (1 hour) noop
Score: 26430.0
atari-games-on-atari-2600-demon-attackES FF (1 hour) noop
Score: 1166.5
atari-games-on-atari-2600-double-dunkES FF (1 hour) noop
Score: 0.2
atari-games-on-atari-2600-enduroES FF (1 hour) noop
Score: 95.0
atari-games-on-atari-2600-fishing-derbyES FF (1 hour) noop
Score: -49.0
atari-games-on-atari-2600-freewayES FF (1 hour) noop
Score: 31.0
atari-games-on-atari-2600-frostbiteES FF (1 hour) noop
Score: 370.0
atari-games-on-atari-2600-gopherES FF (1 hour) noop
Score: 582.0
atari-games-on-atari-2600-gravitarES FF (1 hour) noop
Score: 805.0
atari-games-on-atari-2600-ice-hockeyES FF (1 hour) noop
Score: -4.1
atari-games-on-atari-2600-kangarooES FF (1 hour) noop
Score: 11200.0
atari-games-on-atari-2600-krullES FF (1 hour) noop
Score: 8647.2
atari-games-on-atari-2600-name-this-gameES FF (1 hour) noop
Score: 4503.0
atari-games-on-atari-2600-pongES FF (1 hour) noop
Score: 21.0
atari-games-on-atari-2600-private-eyeES FF (1 hour) noop
Score: 100.0
atari-games-on-atari-2600-qbertES FF (1 hour) noop
Score: 147.5
atari-games-on-atari-2600-river-raidES FF (1 hour) noop
Score: 5009.0
atari-games-on-atari-2600-road-runnerES FF (1 hour) noop
Score: 16590.0
atari-games-on-atari-2600-robotankES FF (1 hour) noop
Score: 11.9
atari-games-on-atari-2600-seaquestES FF (1 hour) noop
Score: 1390.0
atari-games-on-atari-2600-space-invadersES FF (1 hour) noop
Score: 678.5
atari-games-on-atari-2600-star-gunnerES FF (1 hour) noop
Score: 1470.0
atari-games-on-atari-2600-tennisES FF (1 hour) noop
Score: -4.5
atari-games-on-atari-2600-time-pilotES FF (1 hour) noop
Score: 4970.0
atari-games-on-atari-2600-tutankhamES FF (1 hour) noop
Score: 130.3
atari-games-on-atari-2600-up-and-downES FF (1 hour) noop
Score: 67974.0
atari-games-on-atari-2600-ventureES FF (1 hour) noop
Score: 760.0
atari-games-on-atari-2600-video-pinballES FF (1 hour) noop
Score: 22834.8
atari-games-on-atari-2600-wizard-of-worES FF (1 hour) noop
Score: 3480.0
atari-games-on-atari-2600-zaxxonES FF (1 hour) noop
Score: 6380.0

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp