当前位置:首页 > 报告详情

面向大规模人工智能的互连与连接:塑造开放式人工智能系统的互连与存储架构的未来.pdf

上传人: 明**** 编号:1012061 2025-12-21 20页 2.34MB

1、Siamak Tavallaei,Samsung SemiconductorBrian Hirano,XCENASamir Rajadnya,MicrosoftManoj Wadekar,MetaDurgesh SrivastavaInterconnects and Connectivity for At-scale AIA Panel DiscussionSERVER:COMPOSABLE MEMORY SYSTEMS(CMS)As data-intensive applications continue to scale,data-intensive workloads demanda f

2、resh look.In response,the computer industry is working on various memoryfabric-oriented proposals such as CXL,UALink,NVLink,and SUE for efficient data-movement in scale-up systems between xPUs,memory,and other end-devices.In this presentation,the panelists explore the latest trends in high-speedinte

3、rconnect starting from an internal,d2d interconnect such as UCIe toward andexternal interconnect such as photonics for at-scale AI/ML systems.They willdiscuss technical and business aspects of the trade-offs they are making for scale-up and scale-out systems.As part of the OCP/CMS effort,the panelis

4、ts will present their insights into howcomposable systems may help increase system efficiency to drive added value viadisaggregated compute,memory,and storage elements.AbstractBrian Hirano,XCENAAcademic Research and AI FabricsTRACK NAMEWhy speak with Academic Researchers?What academic research is go

5、ing on in“AI Fabrics”What ways to start engaging?What work could be done with academic researchers?AgendaPrompt:I would like to understand how next generation data center systems can use CPUs,GPUs,DPUs,CXL,PCIe,UltraEthernet,UALink,NVLink,memory and storage,so they run AI applications efficiently.Co

6、uld you draw me a diagram to show me how this can be done?Shah,A.,Chidambaram,V.,Cowan,M.,Maleki,S.,et al.,TACCL:Guiding Collective Algorithm Synthesis Using Communication Sketches,NSDI 2023.https:/research.google/blog/alpa-automated-model-parallel-deep-learning/Xinjun Yang,et,al.Unlocking the Poten

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
根据报告的内容,全文主要探讨了在数据密集型应用规模扩大的背景下,计算机行业如何通过高速互连技术提升系统效率。以下是关键点: 1. **高速互连技术**:包括CXL、UALink、NVLink和SUE等,用于在规模扩展系统中实现高效数据传输。 2. **AI/ML系统互连**:从内部互连(如UCIe)到外部互连(如光子学),探讨最新趋势。 3. **可组合系统**:通过解耦计算、内存和存储元素,提高系统效率。 4. **CXL与Fabric**:CXL适合内存扩展,但不足以支持AI规模互连;需要Fabric实现多对多拓扑和共享内存池。 5. **Scale-Up架构**:Fabric是Scale-Up架构的核心,支持大规模AI工作负载。 6. **内存扩展需求**:CPU和GPU平台因内存需求增长而需要额外的内存连接点。 7. **新兴互连技术**:PCIe和CXL已成熟,新兴互连技术解决PCIe/CXL的限制。
"AI数据中心架构揭秘" "CXL与新型互连技术对比" "如何构建高效AI工作负载架构?"
客服
商务合作
小程序
服务号
折叠