当前位置:首页 > 报告详情

人工智能_高性能计算闪电演讲.pdf

上传人: 明**** 编号:1011679 2025-12-21 39页 3.62MB

1、AI/HPC Lightning TalksWongyu Shin,Technical Leader of HardwareChip Design Strategy in AI Application EraGrowing Design ComplexityAdvanced Process Nodes:Increasing transistors(7nm 5nm 4nm 3nm)Beyond reticle:Multi-core Multi-chip module Multi-chiplet 3D LargePackage Size(120mm x 150mm)SLP(substrate-li

2、ke PCB)Beyond package:Scale-up(OAI UBB Backplane fabric within a Rack)Scale-out(Backplane fabric across Racks)Turnaround Time(TAT)PressureKey Challenges Today18 months per package(tape-out)2-3mm)Decoupling the Layers:Protocol vs.Adapter(Link)Strategic Directions for Logical&System-Level Interoperabi

3、lityAddressing Protocol Divergence with Semi-Standard&High-Utilization ProtocolsOptimizing Latency with 256B Latency-Optimized FlitOvercoming Bandwidth Discrepancy:Speed HarmonyBroader Interoperability:Protocols Across Different Physical LinksConclusion&The Collaborative Path ForwardBy clearing defi

4、ning the issues,we can strategically direct our efforts towards robust solutionsFostering broader interoperability by enabling protocols to span various physical links will unlock new levels of system design flexibility and scalabilityStrategic Imperatives&Community Co-working DirectionThank You!Shr

5、ikanth VenkateshappaHow Multi-Voltage Systems Reshape Cost,Efficiency&SustainabilityAI rack power is explodingTakeaway:The current low-voltage approach is inefficient,material-heavy,and unsustainable at AI-scale rack power.Modern AI compute racks will exceed 1 MW each,far above legacy design expecta

6、tions.At 48 V,delivering 1 MW requires 20,833 A,forcing 3 tonnes of copper per MW costly,heavy and difficult to install&operate151026601201401803606001000020040060080010002015201820202023202420252026202720282030Rack Power(kW)YearRack PowerRack PowerNvidia DGX-2(10 kW)Rubin(180 kW)Nvidia DGX B200 rac

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
根据报告的内容,全文主要围绕AI和高性能计算(HPC)领域的挑战和解决方案展开,包括: 1. **芯片设计复杂性增加**:随着先进工艺节点的发展(7nm→5nm→4nm→3nm),设计复杂性不断上升。 2. **芯片封装技术革新**:从单芯片到多芯片模块、芯片let,再到3D封装和大型封装(>120mm x 150mm),以及SLP(类似基板的PCB)。 3. **设计重用和芯片let革命**:强调设计重用的重要性,探讨芯片let(软硬、标准/高级封装)的互操作性挑战。 4. **多电压系统**:提出多电压系统在降低成本、提高效率和可持续性方面的优势,例如±1000V直流供电可减少铜用量和损耗。 5. **存储技术革新**:从DRAM为中心转向SSD意识,通过算法和数据结构协同设计提高IOPS效率。 6. **动态电源管理**:介绍镍锌电池在AI动态电源管理中的应用,提高响应速度和安全性。 7. **网络基础设施测试**:提出GENIE工具,用于模拟和测试网络配置对机器学习工作负载的影响。
挑战与革命" 电力重塑之路" 软件与硬件的协同"
客服
商务合作
小程序
服务号
折叠