InvSqrtDecay
- class opennmt.schedules.InvSqrtDecay(learning_rate, warmup_steps, initial_learning_rate=0)[source]
Decay based on the reciprocal of the step square root. This corresponds to
inverse_sqrt
in Fairseq and--lr-decay-inv-sqrt
in Marian.During warmup (linear increase of the learning rate):
After warmup:
See also
Inherits from:
keras.src.optimizers.schedules.learning_rate_schedule.LearningRateSchedule