Skip to content

Commit

Permalink
Update learning rate for #116 (#117)
Browse files Browse the repository at this point in the history
  • Loading branch information
johnml1135 authored Nov 26, 2024
1 parent bd8707f commit 7eeea95
Showing 1 changed file with 4 additions and 2 deletions.
6 changes: 4 additions & 2 deletions machine/jobs/settings.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -8,15 +8,17 @@ default:
train_params:
do_train: true
optim: adamw_torch
warmup_steps: 4000
warmup_steps: 1000
per_device_train_batch_size: 16
gradient_accumulation_steps: 4
label_smoothing_factor: 0.2
group_by_length: true
gradient_checkpointing: true
lr_scheduler_type: cosine
learning_rate: 0.0002
fp16: true
save_strategy: no
max_steps: 20000
max_steps: 5000
generate_params:
device: 0
num_beams: 2
Expand Down

0 comments on commit 7eeea95

Please sign in to comment.