当前位置:首页 > 报告详情

CXL-PNM 研究历程:架构演进与软件环境.pdf

上传人: 明**** 编号:1011606 2025-12-21 26页 2.53MB

1、Kwangsik Shin()The Journey of CXL-PNM Research:Architecture Evolution and Software EnvironmentThe Journey of CXL-PNM Research:Architecture Evolution and Software EnvironmentKwangsik Shin()Compute scaled exponentially,memory performance failed to keep upExplosive growth of AI&data is outpacing tradit

2、ional computer architectureMemory wall:Data movement,not compute,is the bottleneckThe Data-Centric BottleneckGoogle,“Whats Next for the Foundations of AI”,AI Infra Summit.Sep.10,2025PNM(Processing-Near-Memory)Design Principles:Minimize data movement(bandwidth wall)Leverage memory parallelism(multi-c

3、hannel)Maximize memory bandwidth utilization(streaming)Match compute to memory access patterns(sequential)CXL-Attached Approaches Compute behind CXL endpointsMemory capacity expansionPNM Explained1231PNMDDR HostDDR 123Three Architectural Approaches with Trade-offsComparison TableCXL-PNM Architecture

4、 TaxonomyArchitectureImplementationPerformanceFlexibilityDev RiskLogic-onlyFixedCore-onlyARM/RISC-V onlyHybridFixed+ARM CoresLogic-only prototype(Initial CMM-Ax)Demonstrated huge speedups and energy savings by co-locating compute and data.LimitationsOnly supported a narrow set of pre-defined operato

5、rs;no control for new algorithms.Hybrid design(Current CMM-Ax)Adds ARM cores to run a device-side runtime with flexible microcode,plus streaming data-paths.CMM-Ax Architecture:From Logic-Only to Hybrid Programmable offloads without sacrificing DRAM burst efficiencyCXL EndpointcDeviceMemoryDDRDDRDDRD

6、DRMCMCMCMCInterconnectPFL IPsVA-PATrans.armMACACCCMPData PathControl PathPNM Engine.mem.io&CommandData ,Command Software Platform,Ready for Service-Level EvaluationThree-Layer Software Architecture1.Application LayerFAISS AdaptorVector Search Lib.2.Programming ModelUser Defined Operations3.System In

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
根据《The Journey of CXL-PNM Research: Architecture Evolution and Software Environment》的内容,以下是全文关键点的概括: 1. **数据密集型瓶颈**:传统计算机架构在处理爆炸式增长的AI和数据时,内存性能成为瓶颈,数据移动而非计算成为限制因素。 2. **PNM设计原则**:减少数据移动、利用内存并行性、最大化内存带宽利用、匹配计算与内存访问模式。 3. **CXL-PNM架构**:包括逻辑仅用、核心仅用和混合设计,实现计算与数据近内存处理。 4. **CMM-Ax架构**:从逻辑仅用原型发展到混合设计,支持灵活的微代码和流数据路径。 5. **软件平台**:提供应用层、编程模型和系统集成,支持FAISS集成和Kubernetes设备插件。 6. **性能提升**:与CPU+DIMM相比,CMM-DDR5提供4.54倍吞吐量和5.56倍能效。 7. **实际部署**:在Kubernetes集群中实现50%服务器减少,同时保持90%的性能。 8. **未来方向**:通过ASIC原型验证商业价值,探索与云提供商的合作。
"CXL-PNM如何突破内存瓶颈?" 近内存计算的未来?" AI时代的性能革命?"
客服
商务合作
小程序
服务号
折叠