Deterministic-ally get activation_index, fixed identation, added support for python3 #4

kendricktan · 2017-08-21T09:56:55Z

Hi there, just wanted to say thank you for the blog post and the code example. I noticed that the function compute_rank in finetune.py is mutating a global state, namely grad_index to calculate activation_index.

See:

pytorch-pruning/finetune.py

Line 73 in 7c3a5af

activation_index = len(self.activations) - self.grad_index - 1

While its fine for a single GPU, I noticed that it becomes non-deterministic while being pruned/trained on multiple GPUs.

This pull request solves that issue as well as added support for python3.

jacobgil

Hi, thanks a lot for this.
I will be able to test this over the weekend.
I'm not sure I understand it correctly, I added a comment about some part of the new code. Can you please explain how it works?

jacobgil · 2017-08-21T17:26:08Z

finetune.py

+        for layer, (name, module) in enumerate(self.model.features._modules.items()):
+            x = module(x)
+            if isinstance(module, torch.nn.modules.conv.Conv2d):
+                x.register_hook(self.compute_rank)


self.compute_rank is now a function that returns a function (hook). It looks like the pytorch hook will call compute_rank, it will return hook as a function object (but won't run it), and self.filter_ranks won't be computed anywhere.

self.compute_rank now returns a function (hook). So when self.compute_rank(activation_index) is called, hook (a partial function with the local variable activation_index) is passed in as the call back function for register_hook.

So when the gradients are updated, hook is called, but doesn't need to calculate the activation_index because it's given when you called (self.compute_ranks(INDEX))

Thanks.
But if so, then wasn't the intention to do:
x.register_hook(self.compute_rank(activation_index))
self.activations.append(x)

Othwerwise x isn't appended to self.activations and can't be used from within hook, and pytorch isn't registering the gradient callback to the partial function from self.compute_rank.

kendricktan · 2017-08-21T20:51:24Z

It's hard to explain, but here's a code snippet that explain what partial functions do.

def f(a):
    def F(b):
        return b + 5
    return F

>>> fun = f(10)
>>> fun(3)

jacobgil

Hey @kendricktan, just wanted to make sure you saw my latest comment.
Does it make sense?

jacobgil · 2017-08-21T21:15:06Z

finetune.py

+        for layer, (name, module) in enumerate(self.model.features._modules.items()):
+            x = module(x)
+            if isinstance(module, torch.nn.modules.conv.Conv2d):
+                x.register_hook(self.compute_rank)


Thanks.
But if so, then wasn't the intention to do:
x.register_hook(self.compute_rank(activation_index))
self.activations.append(x)

Othwerwise x isn't appended to self.activations and can't be used from within hook, and pytorch isn't registering the gradient callback to the partial function from self.compute_rank.

kendricktan · 2017-08-27T21:21:16Z

Oh whoops you are completely right, it should be registering the hook with the partial function and appending X to the activations, not the other way around. I should have slept before committing this. I'll change it when I have time, thanks.

kendricktan · 2017-08-28T01:07:06Z

fixed in 212f1b5

jiayouba120035 · 2017-09-22T07:23:29Z

Hello, thank you for the blog post and the code. I run your code but get some problem, "python finetune.py
--train" shows that the test accuracy is about 50%, and the train accuracy is > 95%, I really don't know what's wrong with my implement, so ask for your help. And I consider maybe the data isn't loaded correctly, the test path is /../../test2, and folder "test" is in the folder test2, then the pictures are in folder test, is the data loaded correctly? I am new in python, Thank you in advance for your help.

Aleks1977 · 2018-02-14T21:25:55Z

Как по телефону вычислить хазяина,кто знает помогите 89635264714 вот этого гада!

kendricktan added 2 commits August 21, 2017 19:49

fixed identation and print to work with python2 and 3

7d4c9bb

deterministically get activation index

f6c054f

jacobgil reviewed Aug 21, 2017

View reviewed changes

jacobgil reviewed Aug 27, 2017

View reviewed changes

fixed bug

212f1b5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deterministic-ally get activation_index, fixed identation, added support for python3 #4

Deterministic-ally get activation_index, fixed identation, added support for python3 #4

kendricktan commented Aug 21, 2017

jacobgil left a comment

jacobgil Aug 21, 2017

kendricktan Aug 21, 2017

jacobgil Aug 21, 2017

kendricktan commented Aug 21, 2017

jacobgil left a comment

jacobgil Aug 21, 2017

kendricktan commented Aug 27, 2017

kendricktan commented Aug 28, 2017

jiayouba120035 commented Sep 22, 2017

Aleks1977 commented Feb 14, 2018

Deterministic-ally get activation_index, fixed identation, added support for python3 #4

Are you sure you want to change the base?

Deterministic-ally get activation_index, fixed identation, added support for python3 #4

Conversation

kendricktan commented Aug 21, 2017

jacobgil left a comment

Choose a reason for hiding this comment

jacobgil Aug 21, 2017

Choose a reason for hiding this comment

kendricktan Aug 21, 2017

Choose a reason for hiding this comment

jacobgil Aug 21, 2017

Choose a reason for hiding this comment

kendricktan commented Aug 21, 2017

jacobgil left a comment

Choose a reason for hiding this comment

jacobgil Aug 21, 2017

Choose a reason for hiding this comment

kendricktan commented Aug 27, 2017

kendricktan commented Aug 28, 2017

jiayouba120035 commented Sep 22, 2017

Aleks1977 commented Feb 14, 2018