当前位置:首页 > 报告详情

迈向GPU驱动存储新时代的技术路径.pdf

上传人: 明**** 编号:1011959 2025-12-21 23页 2.12MB

1、1Technical paths to the new era of GPU-initiated storageCJ Newburn,Distinguished Engineer,NVIDIA GPU CloudVikram Mailthody,Senior Researcher,NVIDIA ResearchSTORAGE22App taxonomy:bottlenecks,dynamism,granularityDesigning storage solutions hinges on understanding app requirementsTraditional focus for

2、GPUs has been compute-intensive apps like trainingThe explosion of innovation for VecDB,predictive AI drives new technologies to fill the gapsAll appsCompute-intensiveData-intensiveCPU-initiatedpredictable,coarse-grainedGPU-initiateddynamic,fine-grainedBottleneckStorage accessVecDB search/index,pred

3、ictive AI,relational graphsInference model load,inference KV$,multi-modal small-model trainingLLM training333Styles of compute node storage interactionCompute-intensive appsLLM training/inferenceGenerative AIWorking sets fit in memoryLow bandwidth,not perf criticalcuFile/cuObject,POSIX,S3Coarse grai

4、ned,standard NVMeNAND:large pages,low IOPsTB/TCOData-intensive appsGNNs,vector DB,relational graphsPredictive AI,searchWorking sets spill to storageHigh bandwidth,perf criticalSCADAFine grained,customize front endNAND:many dies*planes,BCH,high IOPsIOPS/TCOCPU,file/objectGPU with cache,item44Getting

5、ahead of the trendAnticipating the needs of emerging usage models while sustaining core volume for legacyBreakout out of artificial memory constraintsBut then storage needs to keep up with memorys sparse IOPsCategoryDisruptive trendWhat we want to doSize wrt memory10TB 1 PBAccess storage like memory

6、 with an APIConcurrencyO(100)/CPU O(100K)/GPUthread accessesNew programming modelaccesses from the GPUAccess patternSparse,random on vector dataNew storage SKUs optimized for sparse IOPs/TCOGranularityCoarse fineProgramming modelGPU autonomyGPU-initiated,fine-grained(SCADA)DisaggregationCant fit TB

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
根据报告的内容,全文主要探讨了GPU驱动的存储新时代,以下为关键点: 1. **应用分类**:将应用分为计算密集型和数据密集型,并区分CPU和GPU驱动的存储访问模式。 2. **存储挑战**:存储访问成为瓶颈,特别是对于数据密集型应用。 3. **计算节点与存储交互**:提出三种交互风格,针对不同应用需求。 4. **突破内存限制**:通过扩展存储容量和优化访问模式,突破内存限制。 5. **新兴工作负载**:随着数据集和内存需求增长,需要新的存储解决方案。 6. **并发性和关键性**:数据密集型应用需要高并发存储IO,GPU比CPU更有效。 7. **性能与成本**:平衡IOPS/TCO和TB/$,优化存储性能和成本。 8. **GPU自主性和协作**:GPU在存储访问中发挥自主作用,同时与CPU协作。 9. **解耦**:通过解耦计算和存储,提高效率和可扩展性。 10. **未来方向**:探索新的存储架构和优化技术,如SCADA和Storage-Next。
挑战与机遇" GPU引领的未来" GPU如何改变游戏规则?"
客服
商务合作
小程序
服务号
折叠