Enhancement: Implement Cyclic Learning Rate and Step-wise Learning Rate Scheduler #211

Franklalalala · 2024-09-27T05:47:27Z

Our current learning rate scheduler has limitations that impact training efficiency and flexibility, especially for large datasets:

It lacks support for Cyclic Learning Rate (CLR), which has shown promising fast-convergence ability, particularly in the initial training stages.
The current scheduler only functions on an epoch-scale, which may be inappropriate for extremely large datasets. (e.g. QH9)

I propose the following enhancements:

Add CLR support in the train_options/lr_scheduler field
Implement common CLR policies (e.g., triangular, triangular2, exp_range)
Add necessary arguments for CLR configuration (e.g., base_lr, max_lr, step_size)

Modify the learning rate scheduler to support updates on a per-step basis
Add a new option in train_options/optimizer field to switch between epoch-wise and step-wise updates
Implement necessary logic to track global steps and update learning rate accordingly

I will propose a PR as soon as possiable!

The text was updated successfully, but these errors were encountered:

Provide feedback