Upgrade torch and correct dim mismatch #3

KE7 · 2023-04-16T01:10:01Z

No description provided.

anentropic · 2023-04-18T11:33:38Z

+1 on this, max versions of dependencies shouldn't be specified in library code (unless library is known to be incompatible with a specific version) ...it just makes life awkward for consumers of the library

anentropic · 2023-04-18T11:41:53Z

ane_transformers/huggingface/test_distilbert.py

@@ -3,6 +3,7 @@
 # Copyright (C) 2022 Apple Inc. All Rights Reserved.
 #

+import einops


this looks like a 3rd party dependency that needs adding to requirements.txt + setup.py ?

mikowals · 2023-04-23T16:01:41Z

The shape mismatch errors should be fixed in the models and not in the tests. The _register_load_state_dict_pre_hook() logic is not correct. Currently it is only applied in the base DistilBert model but that skips the pre_classifier and classifier weights added by other models. Until the model __init__s are fixed it is correct to leave the test failing.

test_distilbert.py will pass by adding a pre_hook in DistilBertForSequenceClassification's __init__ :

self._register_load_state_dict_pre_hook(linear_to_conv2d_map)

I think the same needs to be done for all the models that create pre_classifier and classifier weights. At least that appears to be the intention of linear_to_conv2d_map which expects to handle layers with classifier.weights in the name.

Upgrade torch and correct dim mismatch

f6808be

anentropic reviewed Apr 18, 2023

View reviewed changes

anentropic mentioned this pull request Apr 23, 2023

fix linear_to_conv2d_map to work with other distilbert model types #4

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade torch and correct dim mismatch #3

Upgrade torch and correct dim mismatch #3

KE7 commented Apr 16, 2023

anentropic commented Apr 18, 2023 •

edited

Loading

anentropic Apr 18, 2023

mikowals commented Apr 23, 2023

Upgrade torch and correct dim mismatch #3

Are you sure you want to change the base?

Upgrade torch and correct dim mismatch #3

Conversation

KE7 commented Apr 16, 2023

anentropic commented Apr 18, 2023 • edited Loading

anentropic Apr 18, 2023

Choose a reason for hiding this comment

mikowals commented Apr 23, 2023

anentropic commented Apr 18, 2023 •

edited

Loading