当前位置:首页 > 报告详情

争论现状:可插拔线性光学器件与共封装光学器件.pdf

上传人: 明**** 编号:1011415 2025-12-21 16页 668.68KB

1、State of the Debate:Pluggable and Co-Packaged Optics Vlad Kozlov,LightCountingDemand for optics in AI ClustersSource:Optics for Cloud Data Centers July 2025()Transceivers and GPUs sold by NvidiaSource:Optics for Cloud Data Centers July 2025()Complex correlation between the numbers of GPUs and Transc

2、eivers:NICs,switch SerDes and more.-2.00 4.00 6.00 8.00 10.00 12.00202220232024202520262027202820292030Units(M)400G800G1.6T3.2T-1.00 2.00 3.00 4.00 5.00 6.00202220232024202520262027202820292030Units(M)HopperBlackwellBlackwell UltraRubinRubin UltraFeynmanSwitches for Scale-Out and Scale-Up-200.00 400

3、.00 600.00 800.00 1,000.00 1,200.00 1,400.00 1,600.0020232024202520262027202820292030SerDes lanes(M)50G100G200G400G-200.00 400.00 600.00 800.00 1,000.00 1,200.00 1,400.00 1,600.00 1,800.0020232024202520262027202820292030SerDes lanes(M)100G200G400GSource:Optics for Cloud Data Centers July 2025()Scale

4、-Out:Ethernet and InfiniBandScale-Out:NVLink and otherScale-Up First,Scale-Out LaterSource:Nvidia GTC 202540 x increase in performance!Mostly because of the larger number of GPUs in the scale-up domain:From 8GPUs in NVL8 to 72 GPUs(or 144 GPU dies)in NVL72Google and Nvidia architecturesBoth systems

5、deliver similar performance,but Googles scale up domain is 9,216 TPUs vs.72 GPUs of Nvidia,but Googles ICI between TPUs connectivity is not as“good”as NVLink(no memory sharing).Nvidias system requires almost 2.7x more transceivers(without using optics in scale-up networks).Googles Ironwood Pod compr

6、ises 9,216 TPUs with an aggregate 42.5 exaflops of FP8 performance.It requires 1.5 of 800G transceiver per TPUNvidia would need 118 of its Blackwell Ultra NVL72 racks,which would have 8,496 GPUs in total to reach 42.5 exaflops of FP8 performance.It requires 4 of 800G transceivers

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
根据《》标记中的内容,全文主要内容概括如下: 1. **AI集群光学需求增长**:预计到2030年,800G、1.6T和3.2T的传输器(TRXs)和AOCs需求将显著增长。 2. **GPU与传输器复杂关系**:GPU和传输器的数量之间存在复杂关联,例如Nvidia的Blackwell Ultra NVL72需要大量传输器。 3. **Google与Nvidia架构对比**:Google的Ironwood Pod包含9,216个TPU,而Nvidia的系统需要更多的传输器。 4. **Scale-Up与Scale-Out网络**:Scale-Up网络中,Nvidia的系统需要约2.7倍更多的传输器,而Google的系统在TPU连接性方面不如Nvidia的NVLink。 5. **光学互连技术**:光学互连在Scale-Up网络中越来越受欢迎,而铜互连在Scale-Out网络中更常见。 6. **CPO技术发展**:可插拔CPO和CPC(共封装铜)技术正在发展,预计将提高可靠性并简化设计。 7. **市场预测**:LPO/CPO在Scale-Up网络中的预测有所变化,预计将继续增长。
"AI集群光学需求激增" 未来数据中心的关键" AI集群互连哪家强?"
客服
商务合作
小程序
服务号
折叠