当前位置:首页 > 报告详情

近GPU存储:加速存储以扩展AI工作负载的要求.pdf

上传人: 明**** 编号:1011999 2025-12-21 16页 1.16MB

1、Ta-Yu Wu,MetaEhsan K.Ardestani,MetaNear-GPU Storage:Requirements for Accelerating Storage to Scale AI WorkloadsNear-GPU Storage:Requirements for Accelerating Storage to Scale AI WorkloadsTa-Yu Wu,MetaEhsan K.Ardestani,MetaOCP SPECIAL FOCUS:ARTIFICIAL INTELLIGENCE(AI)Enabling Efficient AI Scaling wit

2、h StorageGPU PerfGrowthNetworkingLarge ClusterPowerBudgetsEnabling Efficient AI Scaling with StorageAI Deployments TodayGPU ServerGPUMemoryScaling AI Deployments for the FutureGPU ServerGPUMemoryStorageModelGrowthRapid SWChangeDataQualityAI Development Contributing FactorsHardware-Related Contributi

3、ng FactorsFocus Areas for GPU Initiated Storage Workloads01020304Performance EfficiencyMeasure performance efficiency in IOPS per dollar/watt in additional to TB per dollar/wattBalance in PerformanceDevices should have high sustained and peak performance for both reads and writesDevice FlexibilityFl

4、exible device architecture allowing different capacity and performance profiles to be deployed at customers site enables rapid alignment with latest workload iteration.Usable IOPSFocus on IOPS vs.Latency as a key performance metricNear GPU Storage Focus AreasNear-GPU Storage:A New Landing Zone for G

5、PU Initiated WorkloadsMemoryNear-GPU StorageCompute StorageWarm StorageCold Storage-Capacity per watt/dollar focus-IO Size are 4K-Topology/Technology:TLC Flash locally attached-New storage tier focused on performance per watt/dollar-IO Sizes are between 512B-4K-Capacity per watt/dollar focus-IO Size

6、 are 4K to MBs-Topology/Technology:TLC/QLC/HDD remote access-Capacity per watt/dollar focus-Large block access-Topology/Technology:HDD remote access-Performance per watt/dollar focus-Word size transfers(=64B)-Topology/Technology:DDR locally attachedIOPS per Watt|DollarCapacity per Watt|DollarAI Appl

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
根据文章内容,全文主要探讨了近GPU存储在加速AI工作负载中的需求。以下是关键点: 1. **性能效率**:关注IOPS/美元/瓦特,以及TB/美元/瓦特。 2. **存储类型**:包括内存、近GPU存储、计算存储、暖存储和冷存储。 3. **AI应用需求**:针对GenAI/LLMs、训练和推理阶段提出存储建议。 4. **性能指标**:强调IOPS和低延迟。 5. **设备灵活性**:支持不同容量和性能配置。 6. **容量与性能平衡**:关注性能/容量灵活性。 7. **AI应用挑战**:如快速软件变化和硬件长周期。 8. **设备制造商策略**:产品规划、控制器设计、写性能提升、灵活性和创新。 9. **用户参与**:分享问题和工作负载,参与开放协作。
如何提升效率?" 未来AI加速关键?" IOPS与功耗平衡?"
客服
商务合作
小程序
服务号
折叠