Avoiding AI/ML traffic congestion with global load balancing