当前位置:首页 > 报告详情

对高性能计算 (HPC) 应用进行屋顶线分析和剖面分析以指导系统设计.pdf

上传人: 明**** 编号:1011819 2025-12-21 14页 2.05MB

1、George Michelogiannakis(miheloglbl.gov)Patricia Gonzalez-Guerrero(lg4erlbl.gov)HPC and AI chiplet modularity workstream co-leadsRoofline Analysis and Profiling of HPC Applications to Guide Modular System DesignSERVER:OPEN CHIPLETECONOMYSpecialization Sustains Performance ScalingSpecialization Requir

2、es InvestmentsChiplet Modularity Makes Specialization AccessibleChiplets MarketplaceProfiling helps us pick a collection of chiplets(newly-developed or available from the community)for a set of applications,as well as their placementApplication Profiling Guides Chiplet DesignApplication characteriza

3、tionCollection of chiplets that maximize metric of interestOCPs chiplet modularity workstream members collected profiling results and performed literature searchWell present some insight from profiling efforts(the topic of this talk)You can join us to help shape this effort!Particle Tracking(ATLAS 2

4、6%in Perlmutter)Roofline Graph34GB memory occupancyAverage IPC 241%of instructions areloads or storesParallel and Serial RegionsLoad/Store HistogramDRAM Access LatencySimulated cache-aware annealing to optimize routing cost of a chip designSA approximates the global optimum in a large search spaceCa

5、nneal(PARSEC Benchmark Suite)Memory bandwidth(GB/s)21%of stalls due to memoryPower timeline.mJ/secProfiling AI AlgorithmsWe use these results to design a systolic array with chiplets.Join our workstream for more!Generate hardware description(models or RTL)or models for HPC and AI appsMap onto existi

6、ng or projected technological advancements in chiplets and packagingAlso,memory and I/O chiplets and protocolsDevelop a systematic mapping method for a set of chiplets in tiles and onto a 2D planeOngoing WorkPlease join us in shaping strate

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
根据标记内容,全文主要内容概括如下: - **HPC和AI芯片模块化工作流**:George Michelogiannakis和Patricia Gonzalez-Guerrero是该工作流的共同负责人。 - **Roofline分析和性能指导**:通过Roofline分析来指导模块化系统设计,提高性能。 - **芯片模块化与市场**:芯片模块化使专业化设计更易实现,并促进市场发展。 - **芯片模块化应用**:通过应用分析指导芯片设计,选择和放置芯片模块。 - **性能分析**:收集了性能分析结果,包括粒子追踪、内存占用、IPC、负载/存储操作等。 - **AI算法分析**:使用分析结果设计基于芯片模块的 systolic array。 - **工作流成员**:OCP的芯片模块化工作流成员进行了文献搜索和性能分析。 - **持续工作**:生成硬件描述、映射到芯片模块和封装技术、开发芯片模块映射方法。 - **号召行动**:邀请加入工作流,共同塑造使用芯片模块化策略。 核心数据: - 粒子追踪(ATLAS 26%在Perlmutter上) - 34GB内存占用 - 平均IPC 241% - 21%的停顿由于内存 - 21%的 stalls due to memory - 21% of stalls due to memory 关键点: - 芯片模块化在HPC和AI中的应用 - 性能分析和Roofline分析指导设计 - 系统atic array设计 - 文献搜索和性能分析结果 - 芯片模块化工作流的持续发展
HPC与AI的未来?" 揭秘芯片性能极限!" 芯片模组化新篇章!"
客服
商务合作
小程序
服务号
折叠