Potential bug in gradient accumulation? #9

dorarad · 2020-03-07T15:39:26Z

For the gradient accumulation case, currently the regularization for the discriminator performs multiple rounds on the same batch of images, instead of averaging the regularization loss over multiple different batches (as is done for the standard discriminator loss).
Is this a bug or is it arranged this way intentionally?

Thanks!

For the gradient accumulation case, fix Discriminator regularization to use a different batch of real images every round.

ghost · 2020-08-06T01:41:29Z

@tkarras will the accumulation bug be addressed?
https://www.reddit.com/r/MachineLearning/comments/i3si0j/d_stylegan2_potential_gradient_accumulation/

dorarad · 2020-08-06T02:30:38Z

While I believe that might be a bug, I tested the difference between with and without loading new data and they were quite small so probably that doesn't affect things significantly. It also doesn't affect the performance at all for the case of using multi-gpu because than there's anyway only one round.

fixed encoder

Update training_loop.py

a39ad02

For the gradient accumulation case, fix Discriminator regularization to use a different batch of real images every round.

igor-sikachyna mentioned this pull request Aug 6, 2020

Potential improvement to the gradient accumulation code #13

Open

patrickxia pushed a commit to patrickxia/stylegan2 that referenced this pull request Jan 10, 2021

Merge pull request NVlabs#9 from xivh/master

ffc1deb

fixed encoder

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Potential bug in gradient accumulation? #9

Potential bug in gradient accumulation? #9

dorarad commented Mar 7, 2020

ghost commented Aug 6, 2020 •

edited by ghost

Loading

dorarad commented Aug 6, 2020

Potential bug in gradient accumulation? #9

Are you sure you want to change the base?

Potential bug in gradient accumulation? #9

Conversation

dorarad commented Mar 7, 2020

ghost commented Aug 6, 2020 • edited by ghost Loading

dorarad commented Aug 6, 2020

ghost commented Aug 6, 2020 •

edited by ghost

Loading