当前位置:首页 > 报告详情

LANL 计算存储:下推的故事.pdf

上传人: 明**** 编号:1011589 2025-12-21 14页 2.71MB

1、Gary GriderLA-UR-_LANL Computational Storage:The Pushdown StoriesSimulations:100k-nM cores,state 100TB-1PB per time step,state 5-100 variables(distributed arrays(columnar)and distributed structs(row)-64bit float.Simulations are 1-10k time steps typicallyCurrent state:read 100tb/1pb time steps or som

2、e subset and let the supercomputer find the parts we wantDealing with this kind of gravity is hard/expensive soNeed to compress but its high entropy sparse floatsNeed to protect but N copies is not feasibleNeed to index/characterize but its too big to sortWant to use standards/standard facing soluti

3、ons,no vendor lock-in,flexible solution so we can track economic changes Supercomputer idle while checkpointing but has little free memory:there is a compute resource available while writing stateDesire to run analytics on smaller machines than simulation,we dont tie up simulation resources for anal

4、ysisLess power,faster time to insight,BackgroundComputational Storage What and Why?Data GravityData Agnostic OffloadsServer memory BW does not allow many passes over streaming dataData Aware OffloadsAnalytics is often multiple orders of magnitude less reading than writingYou just have a hard time fi

5、nding what you are looking for(filter/index/histogram/etc.)Can we add metadata/indexing/ordering to data as it is written with almost no overhead and reap huge wins on read(time,hdwr resources,energy)For ScienceParticle methods-“Ordered”row-based analytics(KV)Grid methods-columnar-based analytics Pu

6、shdown what mostly reduction or nearest neighborCompression/Encoding/erasure(Eideticom,Maxlinear,Intel,Aeon)Block list(Seagate Kinetic and ZFS(library to get block lists from file)KV range(SK hynix KVCSD)Object list(SK hynix OBCSD,Versity,AirMettle,NeuroBlade)

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
根据标记内容,全文主要围绕LANL(Los Alamos National Laboratory)的计算机存储技术展开,强调解决大规模数据处理的挑战。以下是关键点: 1. **大规模模拟**:模拟涉及100k-nM核心,每步状态100TB-1PB,处理5-100个变量(分布式数组和结构)。 2. **数据处理挑战**:需要压缩高熵稀疏浮点数,保护数据,索引数据,但数据量巨大。 3. **计算存储**:利用计算存储解决数据重力问题,如数据无关和相关的卸载。 4. **数据分析和可视化**:通过近存储的列式分析进行科学可视化,减少资源占用,加快洞察力。 5. **技术实现**:使用压缩/编码/纠错技术,块列表,键值范围,对象列表等。 6. **AI应用**:通过GUFI布局和模型上下文协议,实现数据保护和AI应用。
解决方案何在?" 如何实现高效分析?" 数据分析新视角?"
客服
商务合作
小程序
服务号
折叠