Abstract

Existing approaches for managing congestion in an artificial intelligence (AI) data center (DC) network fabric typically involve dedicated hardware and are often reactive in nature. Proposed herein are techniques that can be utilized to avoid fabric congestion instead of building reactive techniques with new hardware. Specifically, a Segment Routing-Traffic Engineering (SR-TE) agent is proposed herein that can be utilized to avoid fabric congestion, which can enable enterprises to use their existing data centers for AI training.

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.

Share

COinS