Lr scheduler
SchedulerOutput
Bases: TypedDict
Output of the scheduler method.
Source code in bionemo/llm/model/lr_scheduler.py
33 34 35 36 37 38 |
|
WarmupAnnealDecayHold
Bases: _LRScheduler
Warmup Anneal Decay Hold learning rate scheduler.
Source code in bionemo/llm/model/lr_scheduler.py
41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 |
|
__init__(optimizer, *, warmup_steps=None, max_steps=None, max_lr=None, min_lr=4e-05, anneal_percentage=0.1, last_epoch=-1)
Initializes the WarmupAnnealDecayHold learning rate scheduler.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
optimizer
|
MegatronOptimizerModule
|
Optimizer to apply the learning rate scheduler. |
required |
warmup_steps
|
int
|
Number of steps for the linear warm-up. |
None
|
max_steps
|
int
|
Total number of training steps. |
None
|
max_lr
|
float
|
Peak learning rate to be achieved after warm-up. |
None
|
min_lr
|
float
|
Minimum learning rate. |
4e-05
|
anneal_percentage
|
float
|
Percentage of the max_lr to hold after decay. |
0.1
|
last_epoch
|
int
|
The index of the last epoch. |
-1
|
Source code in bionemo/llm/model/lr_scheduler.py
44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 |
|
get_lr()
Get the learning rate at the current step.
Source code in bionemo/llm/model/lr_scheduler.py
78 79 80 81 82 83 84 85 86 87 88 |
|
WarmupAnnealDecayHoldScheduler
Bases: LRSchedulerModule
Warmup Policy Learning Rate Scheduler.
Source code in bionemo/llm/model/lr_scheduler.py
91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 |
|
__init__(warmup_steps=2000, max_steps=500000, max_lr=0.0004, min_lr=4e-05, anneal_percentage=0.1, interval='step', frequency=1, monitor='val_loss')
Initializes the WarmupAnnealDecayHoldScheduler.
Source code in bionemo/llm/model/lr_scheduler.py
94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 |
|
scheduler(model, optimizer)
Returns the scheduler output.
Source code in bionemo/llm/model/lr_scheduler.py
116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 |
|