Topic

Tensor Parallelism

Splitting individual layers across devices.

2 checkpoints