distributed-training 2 Notes on PyTorch's Distributed Data Parallel (DDP) May 13, 2025 Distributed training technologies for Transformers: Overview Aug 30, 2024