Fix LAMB optimizer regex parsing #1532

jarednielsen · 2020-04-01T21:51:55Z

See my issue at #1530

The LAMB optimizer declares that its exclude_from_weight_decay argument should take in a comma-separated string of regex patterns. However, the code expects a list of regex patterns and instead iterates through each character in the string. Thus nearly every call to _do_use_weight_decay() returns False.

I attempted to pass in a list to circumvent this bug, but this leads to a typeguard error. So there's no easy way around this in the meantime. A similar bug exists for exclude_from_layer_adaption.

Two proposed fixes, and I'm happy to contribute either:

Change the desired datatype from Optional[str] to List[str]. This would be preferred, and match the style of other implementations in the TensorFlow repo. See here for a list of examples. This is the current PR.
Add .split(',') to exclude_from_weight_decay and exclude_from_layer_adaption in the constructor.

bot-of-gabrieldemarmiesse · 2020-04-01T21:52:35Z

@junjiek

You are owner of some files modified in this pull request.
Would you kindly review the changes whenever you have the time to?
Thank you very much.

gabrieldemarmiesse · 2020-04-01T22:11:14Z

Thanks @jarednielsen for this pull request. I'll take a longer look tomorrow to have the big, for the moments, some quick thoughts in 2 minutes:

We need to be backward compatible, at least for the near term. We can throw a depreciation warning if we don't want to keep the old behavior in the long term.
Being as close as possible to the TF API is very important for the long term.

If there is a way we can fit both in this pull request, that'd be perfect :)

jarednielsen · 2020-04-01T22:19:25Z

Thanks for the quick response! I'm all for backwards compatibility. I'm just not seeing how any usage of this parameter could possibly succeed. For example, using

exclude_from_weight_decay=["weight", "bias"] would fail because of typeguard.
exclude_from_weight_decay="weight,bias" would be iterated over as ["w", "e", "i", "g", "h", "t", ",", "b", "i", "a", "s"]. If any single one of these characters existed in the parameter name, then the parameter would be excluded. This is clearly wrong.

What backwards-compatible behavior would we want to keep? I suppose if your regex was literally just a single character and you only had one pattern to match, then that use case would break. So maybe exclude_from_weight_decay="*"? Although in that case they should just set weight_decay=0 and be done with it.

seanpmorgan · 2020-04-02T00:56:11Z

I'm not so sure backwards compatibility here is what's needed since passing a string variable (as enforced in typeguard) would fail. This isn't so much an improvement, but rather a bug fix.

The model garden is one of the more popular repos depending on addons so I think it's important we patch this onto 0.8.4

seanpmorgan · 2020-04-02T00:57:06Z

@jarednielsen would you mind adding a test case for calling the optimizer with these parameters so this type of thing would be caught in the future please?

jarednielsen · 2020-04-02T18:02:28Z

@seanpmorgan Sure, added the test!

seanpmorgan · 2020-04-02T19:58:22Z

tensorflow_addons/optimizers/lamb_test.py

@@ -401,3 +401,11 @@ def test_get_config(self):
        opt = lamb.LAMB(1e-4)
        config = opt.get_config()
        self.assertEqual(config["learning_rate"], 1e-4)
+
+    def test_exclude_weight_decay(self):


Could you add a test for exclude_from_layer_adaption as well please?

gabrieldemarmiesse

Thanks a lot for the pull request! That's some great investigation work and solution!

* Fix type for LAMB optimizer exclude_from_weight_decay * Add import * Add optional wrapper * Add test * Layer adaption test * Typo

jarednielsen added 2 commits April 1, 2020 14:47

Fix type for LAMB optimizer exclude_from_weight_decay

ceb0859

Add import

43cdde5

boring-cyborg bot added the optimizers label Apr 1, 2020

googlebot added the cla: yes label Apr 1, 2020

jarednielsen mentioned this pull request Apr 1, 2020

LAMB optimizer parses regex patterns improperly #1530

Closed

Add optional wrapper

910a0a6

gabrieldemarmiesse self-assigned this Apr 1, 2020

seanpmorgan added the backport r0.8 Will backport any PR merged with this label to the r0.8 branch label Apr 2, 2020

Add test

c79c652

Resolve merge conflict

8282978

seanpmorgan reviewed Apr 2, 2020

View reviewed changes

jarednielsen added 2 commits April 2, 2020 14:42

Layer adaption test

d6e2523

Typo

71bcfd8

gabrieldemarmiesse approved these changes Apr 3, 2020

View reviewed changes

gabrieldemarmiesse merged commit ce16e62 into tensorflow:master Apr 3, 2020

bot-of-gabrieldemarmiesse mentioned this pull request Apr 3, 2020

[Backport r0.8] Fix LAMB optimizer regex parsing #1555

Merged

jrruijli pushed a commit to jrruijli/addons that referenced this pull request Dec 23, 2020

Fix LAMB optimizer regex parsing (tensorflow#1532)

590513b

* Fix type for LAMB optimizer exclude_from_weight_decay * Add import * Add optional wrapper * Add test * Layer adaption test * Typo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix LAMB optimizer regex parsing #1532

Fix LAMB optimizer regex parsing #1532

jarednielsen commented Apr 1, 2020

bot-of-gabrieldemarmiesse commented Apr 1, 2020

gabrieldemarmiesse commented Apr 1, 2020

jarednielsen commented Apr 1, 2020 •

edited

Loading

seanpmorgan commented Apr 2, 2020 •

edited

Loading

seanpmorgan commented Apr 2, 2020

jarednielsen commented Apr 2, 2020

seanpmorgan Apr 2, 2020

jarednielsen Apr 2, 2020

gabrieldemarmiesse left a comment

Fix LAMB optimizer regex parsing #1532

Fix LAMB optimizer regex parsing #1532

Conversation

jarednielsen commented Apr 1, 2020

bot-of-gabrieldemarmiesse commented Apr 1, 2020

gabrieldemarmiesse commented Apr 1, 2020

jarednielsen commented Apr 1, 2020 • edited Loading

seanpmorgan commented Apr 2, 2020 • edited Loading

seanpmorgan commented Apr 2, 2020

jarednielsen commented Apr 2, 2020

seanpmorgan Apr 2, 2020

Choose a reason for hiding this comment

jarednielsen Apr 2, 2020

Choose a reason for hiding this comment

gabrieldemarmiesse left a comment

Choose a reason for hiding this comment

jarednielsen commented Apr 1, 2020 •

edited

Loading

seanpmorgan commented Apr 2, 2020 •

edited

Loading