Skip to content

jihan1218/brax

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Customized PPO Training with Parameter Initialization from Previous Training

This lightly modified version of the brax.training.agents.ppo.train module allows users to continue training their model by initializing parameters from a previous training session. By providing the previous_params parameter, users can load the parameters from a previous run and start training from that point onwards, rather than starting with randomized parameters every time.

Requirement

You need to either clone this repo or copy brax.training.agents.ppo.train and brax.training.agents.ppo.networks files and replace into your python library.

Usage

After your first training train_fn, save your parameters. (Here you must run your initial training using the updated files because the current output params from the training only contains policy.)

from brax.training.agents.ppo import train as ppo

train_fn = functools.partial(
    ppo.train, num_evals=10, ... YOUR OWN PARAMETERS)

make_inference_fn, params, _= train_fn(environment=env, progress_fn=progress)
model.save_params(model_file_path, params)

To continue your training from the previous run, simply load the parameters and feed them into previous_params.

previous_params = model.load_params(model_file_path)

train_fn = functools.partial(
    ppo.train, num_evals=100, ... , previous_params = previous_params)

make_inference_fn, params, _= train_fn(environment=env, progress_fn=progress)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published