Sunday, 15 September 2024

Artificial Intelligence for Network Engineers: Introduction

Several books on artificial intelligence (AI) and deep learning (DL) have been published over the past decade. However, I have yet to find a book that explains deep learning from a networking perspective while providing a solid introduction to DL. My goal is to fill this gap by writing a book titled AI for Network Engineers (note that the title name may change during the writing process). Writing about such a complex subject will take time, but I hope to complete and release it within a year.

Part I: Deep Learning and Deep Neural Networks

The first part of the book covers the theory behind Deep Learning. It begins by explaining the construct of a single artificial neuron and its functionality. Then, it explores various Deep Neural Network models, such as Feedforward Neural Networks (FNN), Convolutional Neural Networks (CNN), and Recurrent Neural Networks (RNN). Next, the first part discusses data and model parallelization strategies such as Data, Pipeline, and Tensor Parallelism, explaining how input data and/or model sizes that exceed the memory capacity of GPUs within a single server can be distributed across multiple GPU servers.

Part II: AI Data Center Networking - Lossless Ethernet

After a brief introduction of RoCEv2, the second part continues from part one by explaining how parallelization strategies affect network utilization. It then discusses the Data Center Quantized Congestion Notification (DCQCN) scheme for RoCEv2, introducing key concepts such as Explicit Congestion Notification (ECN) and Priority-based Flow Control (PFC). In addition to ECN and PFC, this section covers other congestion-avoidance methods, such as packet spraying and deep buffers. The second part also delves into AI data center design choices focusing on the East-West backend network. It introduces Rail, Top-of-Rack (ToR), and Rail-Optimized designs.



Figure 1:
Book Introduction.

7 comments:

  1. This comment has been removed by the author.

    ReplyDelete
    Replies
    1. Thank you Toni. Looking forward to read on this interesting subject from network engineers perspective. I have read your other books on complex technology and keen to read this too. Appreciate your work..

      Delete
    2. Thank you Sarfaraz for you supportive comment :)

      Delete
  2. Looking forward to this. Your vxlan/evpn books set the standard when it comes to understanding what’s under the hood and how things operate down to the packet level.

    ReplyDelete
  3. As a long time solutions engineer in LAN/WAN/Security/DC, etc., I've found myself slightly caught off guard by the explosion in AI and backend systems. Like you noted, I haven't seen anything like this in the market. I'm definitely excited about this publication! Thanks for continuing to push the bar!

    ReplyDelete
  4. looking at the 2 parts described, sounds more like 'Networking for AI' than 'AI for Network Engineers'

    ReplyDelete