Adds nn.Normalize #341

fmassa · 2015-08-01T14:18:34Z

Generalizes #260 from @karpathy to accept arbitrary L_p norms.
A few remarks:

Maybe LpNormalize or Normalize is a better name ?
Only accepts 1D/2D inputs. @nicholas-leonard proposed to add a dim parameter to make it equivalent to torch.norm. Is it worth it given that we have SpatialBatchNormalization ?
updateGradInput is quite memory consuming for large d.

szagoruyko · 2015-08-01T19:02:04Z

definitely the name has to be Normalize, not Norm
is it hard to extend to other dimensions? SpatialBatchNormalization is different

fmassa · 2015-08-02T18:03:14Z

@szagoruyko Maybe it's not too difficult to extend it to work for other dimensions, by viewing the input to be 2D with the last dimension the desired normalized dimension, but one has to take care about batched/non-batched inputs as well. Maybe we could add a setNumInputDims function, as in nn.View ?
I'll spend some more time trying to make it more generic.

Everybody ok to renaming it to Normalize ?

karpathy · 2015-08-02T19:04:16Z

Why is it obvious that the name should be Normalize and not Norm? As far as I can tell almost all operations from torch are ported without name changes to nn layers. E.g. analogous to max operation there is a nn.Max, so why is it obvious that norm operation should be nn.Normalize? Shouldn't we have nn.Maximize then? My first instinct would be to stick with the current naming for naming consistency.

soumith · 2015-08-02T19:05:24Z

torch.norm returns the p-norm. isn't this module also normalizing the input according to the returned norm?

karpathy · 2015-08-02T19:08:36Z

Good point. Almost suggest that there should be both nn.Norm that does exactly what norm does, and then nn.Normalize that also does a div right afterwards. But perhaps that gets a bit too hairy then :)

soumith · 2015-08-02T19:09:11Z

That's actually a good idea :)

soumith · 2015-09-04T14:55:25Z

@fmassa looks like it's good to go is it?

fmassa · 2015-09-04T15:19:35Z

It's good. I just need to squash the commits.
I didn't had the time to add a dimension parameter though, it would complicate a bit the logic because of the setNumInputDims function. But it could be added later if needed.

soumith · 2015-09-11T19:42:36Z

squash, good to go.

fmassa · 2015-09-12T22:24:38Z

@soumith I just squashed the commits.

Adds nn.Normalize

soumith · 2015-09-14T04:16:29Z

thanks a lot Francisco, for the excellent PR.

ffmpbgrnn · 2015-10-03T02:47:23Z

Lots of memory is needed in backprop of this module. One reason might be creating the eyeExpand matrix and later doing multiplication. In my case, when the batch size is 64, input dimension is 4800, two Normalize layer would be out of memory for one 4G memory GPU. Any idea to implement a more space efficient Normalize layer?

fmassa · 2015-10-03T09:26:15Z

@ffmpbgrnn here is a version of Normalize which uses much less memory (it doesn't depend on the batch size anymore). It should be slower on GPU. The tests passes so it should be fine. Use it with fastMode(false).
fmassa@015ba9c

ffmpbgrnn · 2015-10-03T17:52:50Z

@fmassa thanks for your work. I will give a try!

fmassa · 2015-10-05T21:27:12Z

@ffmpbgrnn so, does this patch works fine for you ? Is it much slower than the previous version ?
Maybe we could push this simplified version to master (taking of the faster mode to make things simple) ?
cc @soumith

ffmpbgrnn · 2015-10-08T18:25:07Z

@fmassa , sorry for my late response..Yes, it passes the test. But when I test fastMode(false) mode on GPU, it's even faster than the fastMode(true).

require 'nn'
require 'cutorch'
require 'cunn'
local module = nn.Normalize(2):cuda()
module:fastMode(false)
local input = torch.rand(64, 2400):cuda()
local t = torch.Timer()
for i = 1, 100 do
    module:forward(input)
    module:backward(input, input)
    print(i)
end
print(t:time().real/100)

fastMode(false), output: 0.09
fastMode(true), output: 0.12

soumith changed the title ~~Adds Norm~~ Adds Normalize Sep 4, 2015

soumith changed the title ~~Adds Normalize~~ Adds nn.Normalize Sep 4, 2015

fmassa force-pushed the norm branch from b9b56d5 to dc9cbe0 Compare September 12, 2015 22:13

Adds Normalize

b80bda2

fmassa force-pushed the norm branch from dc9cbe0 to b80bda2 Compare September 12, 2015 22:16

soumith added a commit that referenced this pull request Sep 14, 2015

Merge pull request #341 from fmassa/norm

5fa1a8b

Adds nn.Normalize

soumith merged commit 5fa1a8b into torch:master Sep 14, 2015

fmassa deleted the norm branch September 14, 2015 06:22

bamos mentioned this pull request Oct 13, 2015

Speedup and reduce memory usage in Normalize #426

Merged

Adds nn.Normalize #341

Adds nn.Normalize #341

Uh oh!

Conversation

fmassa commented Aug 1, 2015

Uh oh!

szagoruyko commented Aug 1, 2015

Uh oh!

fmassa commented Aug 2, 2015

Uh oh!

karpathy commented Aug 2, 2015

Uh oh!

soumith commented Aug 2, 2015

Uh oh!

karpathy commented Aug 2, 2015

Uh oh!

soumith commented Aug 2, 2015

Uh oh!

soumith commented Sep 4, 2015

Uh oh!

fmassa commented Sep 4, 2015

Uh oh!

soumith commented Sep 11, 2015

Uh oh!

fmassa commented Sep 12, 2015

Uh oh!

soumith commented Sep 14, 2015

Uh oh!

ffmpbgrnn commented Oct 3, 2015

Uh oh!

fmassa commented Oct 3, 2015

Uh oh!

ffmpbgrnn commented Oct 3, 2015

Uh oh!

fmassa commented Oct 5, 2015

Uh oh!

ffmpbgrnn commented Oct 8, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants