Naive question about multiple seeds #95

kfu02 · 2024-06-06T18:47:48Z

Hi,

I am familiar with MARL in Pytorch, but very new to JAX, so please forgive me if this question is naive.

I see that many of your baselines are parallelized over multiple seeds at once (e.g. here in QMIX or here in transfQMIX). However, when running the baselines I notice that the resulting WandB runs seem to aggregate the seeds together. Is there some way to separate the performance of each seed for plotting purposes (e.g. to report the min/avg/max)? Your paper has several average return curves with some sort of error shading, so I imagine I must be missing something obvious.

amacrutherford · 2024-06-08T17:15:19Z

Hey! So

Thanks for reaching out and exciting that you are trying out JAX. Off the top of my head, I think for wandb the easiest way is to run one seed per script and then sweep over each seed with wandb sweeps (and if you set XLA_PYTHON_CLIENT_PREALLOCATE=false as an environment variable you can then run multiple scripts on one GPU but this is quite a bit less efficient then multiple seeds over one device). Have I missed something @mttga ?

kfu02 · 2024-06-08T22:53:22Z

Hi, thanks for the reply!

Okay, so you're saying the answer is simply not to parallelize across seeds, then use WandB's tools to aggregate separate 1-seed runs together. If I'm understanding that correctly, then what is being plotted when I run multiple seeds in parallel? The average across those seeds?

mttga · 2024-06-09T16:16:28Z

The parallel runs will plot in the same space, meaning that you will have datapoints from all your runs but you will not be able to distinguish them. To distinguish them you can use an approach like this:

def function(rng):

  original_seed = rng[0]
  
  # random stuff
  
  metrics = # a dictionary of your logging metrics
  
  def callback(metrics, original_seed):
        metrics.update({
            f'rng{int(original_seed)}/{k}':v
            for k, v in metrics.items()
        })
        wandb.log(metrics)
  
    jax.debug.callback(callback, metrics, original_seed)

we will include training code like this soon

Chulabhaya · 2025-01-25T20:24:15Z

Hi all! I would just like to clarify a dumb question with regards to the current Jax setup and the WandB logging. When you run training with multiple seeds, with the first set of plots generated (with or without WANDB_LOG_ALL_SEEDS), is there a way to have all the aggregated data shown in such a way that WandB will show the mean/std of those runs combined? Currently it seems like when they're aggregated WandB will consider all your seeds as a single run so it won't show the std shading. And if you log all seeds separately then I'm not sure how to then combine them in the WandB interface to see the combined mean/std shading. Thanks!

amacrutherford assigned mttga Jun 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Naive question about multiple seeds #95

Naive question about multiple seeds #95

kfu02 commented Jun 6, 2024

amacrutherford commented Jun 8, 2024 •

edited

Loading

kfu02 commented Jun 8, 2024

mttga commented Jun 9, 2024

Chulabhaya commented Jan 25, 2025

Naive question about multiple seeds #95

Naive question about multiple seeds #95

Comments

kfu02 commented Jun 6, 2024

amacrutherford commented Jun 8, 2024 • edited Loading

kfu02 commented Jun 8, 2024

mttga commented Jun 9, 2024

Chulabhaya commented Jan 25, 2025

amacrutherford commented Jun 8, 2024 •

edited

Loading