Drop in performance when changing dtype to float32 #492

blaisedelattre · 2024-06-11T09:27:18Z

Hello everyone, has anyone noticed a drop in performance (test and validation loss) when training with dtype=float32 ? I'm doing training on the Shakespeare dataset with the train_shakespeare_char config file.

I have not changed anything but dtype from the original repo.

It seems related to the use of "torch.amp.autocast" but I don't understand why a higher precision from bfloat16 to float32 would cause a drop in perf.

Thank you for your help !

kalgoritmi · 2024-07-03T06:54:29Z

It is just a speculation, but maybe it introduces some form of regularization in the model, given the small size of the data and the model it may actually allow it to generalize better?

vnsmv · 2024-07-03T17:40:07Z

Hello!
Can you please provide some more context about you problem?
Its possible to share you code and your test/validation losses?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Drop in performance when changing dtype to float32 #492

Drop in performance when changing dtype to float32 #492

blaisedelattre commented Jun 11, 2024

kalgoritmi commented Jul 3, 2024

vnsmv commented Jul 3, 2024

Drop in performance when changing dtype to float32 #492

Drop in performance when changing dtype to float32 #492

Comments

blaisedelattre commented Jun 11, 2024

kalgoritmi commented Jul 3, 2024

vnsmv commented Jul 3, 2024