Design and Implementation of MPI Collective Operations for Large Message Communication on AMD GPUs C. Chen, L. Xu, H. Subramoni, D. Panda ISC HIGH PERFORMANCE 2025, Jun 2025.