High loss and low bleu-4 for training #192

loserlulin9 · 2023-02-16T09:28:01Z

When I train a new model in flickr8k and flickr30k dataset in my environment, I find that the trianing loss is too high(about 10) and the bleu-4 is too low(about 2.4e-232) after 20 epochs. It is also very strange that the parameter epochs since last improvement is 20. I didn't change the train.py code except some small bugs. How can I improve it? Is anyone having the same problem? THANKS!!!

AndreiMoraru123 · 2023-02-16T09:43:45Z

What exactly have you changed in the code?

Be wary of erasing things like

global best_bleu4, epochs_since_improvement, checkpoint, start_epoch, fine_tune_encoder, data_name, word_map

PEP will mark those as warnings, but here they they have a good use.

loserlulin9 · 2023-02-16T10:31:18Z

I just change the code "scores, _ = pack_padded_sequence(scores, decode _lengths, batch_first = True)" to "scores = pack_padded_sequence(scores, decode _lengths, batch_first = True).data " to debug. I also change some data parameters in the begin of train.py but I don't think it would influence a lot. I didn't change the global parameters code. Do you know how to make the loss convergence? Should I lower the learning rate?

AndreiMoraru123 · 2023-02-16T10:38:20Z

Have you tried this fix instead?

loserlulin9 · 2023-02-16T10:47:46Z

Have you tried this fix instead?

Yeah, I just delete the '_', but the cross entrypy loss must accept two tensor parameters. So I add the '.data' to the end of this code.

AndreiMoraru123 · 2023-02-16T11:06:30Z

That's true. They should be the same in the loss by using .data. Curios, is your loss just not decreasing, or is it getting worse?

loserlulin9 · 2023-02-16T11:50:33Z

That's true. They should be the same in the loss by using .data. Curios, is your loss just not decreasing, or is it getting worse?

My trian.py works, but the loss just not decreases.

Kevinskt · 2024-02-29T08:57:35Z

I change the code "scores = pack_padded_sequence(scores, decode_lengths, batch_first=True)[0]",
Because he required these in the new version. After I finished these, I didn't encounter your situation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

High loss and low bleu-4 for training #192

High loss and low bleu-4 for training #192

loserlulin9 commented Feb 16, 2023

AndreiMoraru123 commented Feb 16, 2023 •

edited

Loading

loserlulin9 commented Feb 16, 2023

AndreiMoraru123 commented Feb 16, 2023

loserlulin9 commented Feb 16, 2023

AndreiMoraru123 commented Feb 16, 2023

loserlulin9 commented Feb 16, 2023

Kevinskt commented Feb 29, 2024

High loss and low bleu-4 for training #192

High loss and low bleu-4 for training #192

Comments

loserlulin9 commented Feb 16, 2023

AndreiMoraru123 commented Feb 16, 2023 • edited Loading

loserlulin9 commented Feb 16, 2023

AndreiMoraru123 commented Feb 16, 2023

loserlulin9 commented Feb 16, 2023

AndreiMoraru123 commented Feb 16, 2023

loserlulin9 commented Feb 16, 2023

Kevinskt commented Feb 29, 2024

AndreiMoraru123 commented Feb 16, 2023 •

edited

Loading