High-Performance Network- and GPU-Aware Communication for MPI Partitioned and MPI Neighbourhoods