Topic

Optimization

Optimizers, learning-rate schedules, and training dynamics.

2 checkpoints