SAC BipedalWalkerHardcore

A sample code used to solve BipedalWalker-v3 & BipedalWalkerHardcore-v3

Requirement

python 3.7
pytorch 1.0.1 [ Warning ! ]
(I ever used v1.7, then I waste a month to deal with it using the same code but without any network changes, please be careful!)
gym 0.13.1

Hyperparameters

Agent uses the following hyperparameters:

gamma=0.99
batch_size=256
lr=5e-4
hidden_size=400
tau=0.005
alpha=0.2
reward_scale = 5 // reward *= reward_scale
capacity=2000000

Techical Report

https://github.com/CoderAT13/BipedalWalkerHardcore-SAC/blob/main/data/BipedalWalkerTest.md

How to use my code?

Train from blank network

$ python main.py --train=1

Train from exist network

$ python main.py --train=1 --load=1

Play with the network

$ python main.py --load=1 --render=1

MyResult

BiliBili: https://www.bilibili.com/video/BV1DK4y1j7Nz/

Bipedalwalker

BipedalwalkerHardcore

Credit

Pranjal Tandon (https://github.com/pranz24).

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.svn		.svn
__pycache__		__pycache__
data		data
imgs		imgs
logs_data_hard1		logs_data_hard1
logs_data_success		logs_data_success
models		models
models_hard1		models_hard1
models_success		models_success
.gitignore		.gitignore
README.md		README.md
_config.yml		_config.yml
main.py		main.py
model.py		model.py
nohup.out		nohup.out
replay_memory.py		replay_memory.py
sac_agent.py		sac_agent.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SAC BipedalWalkerHardcore

Requirement

Hyperparameters

Techical Report

How to use my code?

MyResult

Bipedalwalker

BipedalwalkerHardcore

Credit

About

Releases

Packages

Languages

CoderAT13/BipedalWalkerHardcore-SAC

Folders and files

Latest commit

History

Repository files navigation

SAC BipedalWalkerHardcore

Requirement

Hyperparameters

Techical Report

How to use my code?

MyResult

Bipedalwalker

BipedalwalkerHardcore

Credit

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages