Add functionality to select heads to evaluate error tables on and perform dry runs. #836

IsaacParker30 · 2025-02-27T20:42:31Z

Added 2 new arguments:

--eval_heads
This allows the user to specify which heads they want to evaluate / print an error table for at the end of training. If not set, will default evaluate all heads.
Usage: Specify all heads you want to evaluate on as a string with heads separated by commas. For example:
--eval_heads='default,pt_head,head3'
Example use scenario would be for replay fine-tuning if the user doesn't want to evaluate the large foundation model database on the pt_head.

--dry_run
Adding this argument will stop the run_train.py script just before the model training is about to begin (calls the tools.train() function. This allows the user to check if they've set their parameters correctly before beginning an expensive training run.

I have tested:

If dry run works.
If I can select heads to turn off.
If code evaluates on all heads if none specified.
If code stops if user specifies a head to evaluate on that doesn't exist.

Let me know if any changes needed.

…art of training)

vue1999

Please fix the pre-commit check suggestions, otherwise looks good.

mace/tools/tables_utils.py

mace/cli/run_train.py

alinelena · 2025-03-01T09:17:48Z

@IsaacParker30 this addresses some of the things in #839 should have checked the PR before writing it. I wonder if makes sense to implement also first part in this.

alinelena · 2025-03-01T15:13:28Z

one small point, if you do not need the head stats at the end you may not want to print it during the training either.

alinelena · 2025-03-06T07:01:26Z

@IsaacParker30 this will need a small rebase

IsaacParker30 added 2 commits February 27, 2025 20:15

Add ability to only evaluate on certain heads.

bba0749

Added --dry_run argument to allow for dry runs (run all code up to st…

9976d02

…art of training)

vue1999 self-assigned this Feb 27, 2025

vue1999 requested changes Feb 27, 2025

View reviewed changes

mace/tools/tables_utils.py Outdated Show resolved Hide resolved

IsaacParker30 added 3 commits February 27, 2025 21:42

Tidy Up

35412d5

Remove random tab

fb69bfd

remove Unexpected keyword arguments

c9818a1

vue1999 reviewed Feb 28, 2025

View reviewed changes

mace/cli/run_train.py Outdated Show resolved Hide resolved

Find all missing heads at same time and improve log messages.

3067479

vue1999 previously approved these changes Feb 28, 2025

View reviewed changes

vue1999 removed their assignment Feb 28, 2025

RokasEl mentioned this pull request Mar 1, 2025

refine reporting for training/finetunning #839

Open

Evaluate error tables every N epochs using --error_table_interval

97e691c

IsaacParker30 dismissed vue1999’s stale review via 97e691c March 2, 2025 20:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add functionality to select heads to evaluate error tables on and perform dry runs. #836

Add functionality to select heads to evaluate error tables on and perform dry runs. #836

IsaacParker30 commented Feb 27, 2025

vue1999 left a comment

alinelena commented Mar 1, 2025

alinelena commented Mar 1, 2025

alinelena commented Mar 6, 2025

Add functionality to select heads to evaluate error tables on and perform dry runs. #836

Are you sure you want to change the base?

Add functionality to select heads to evaluate error tables on and perform dry runs. #836

Conversation

IsaacParker30 commented Feb 27, 2025

vue1999 left a comment

Choose a reason for hiding this comment

alinelena commented Mar 1, 2025

alinelena commented Mar 1, 2025

alinelena commented Mar 6, 2025