ETL 处理的架构分析:CPU 与 GPU.pdf

编号:167558 PDF 24页 1.12MB 下载积分:VIP专享
下载报告请您先登录!

ETL 处理的架构分析:CPU 与 GPU.pdf

1、1Architecture Analysis for ETL Processing:CPU vs GPUNikolay Sakharnykh,Senior AI Developer Technology Manager Jason Lowe,Distinguished System Software EngineerData+AI Summit 2024Performance limiters of Database OperationsCPU and GPU architectures overviewGPU performance on join and full SQL QueriesR

2、APIDS Accelerator for Apache SparkRAPIDS Accelerator Benchmarks2AgendaCan GPUs accelerate ETL?Database Operations and Their Performance LimitersKeyPayloadk1p1k2p2k3p3k4p4 Sequential access to input/output tables(read/write)Random fine-grained access to hash tables(CAS/read)Integer computations for h

3、ashingHash tablek2,p2k1,p1k3,p3k4,p4Hash join-buildHash join-probeKeyPayloadk1p1k2p2k3p3k4p4Hash tablecompare and swapreadwriteKeyPayloadk1p1k2p2k3p3k4p43CPU and GPU ArchitecturesSML2HBMXBARH U BPCIe I/OPCIe BusGPUL2L2HBML2SMSMSMSMSMSMSMNVLINKSMSMSMSMSMSMSMSMCPU45CPU and GPU Architectures Memory Spe

4、edsHBMPCIe BusCPU GPUBandwidth is calculated as the size of access multiplied by the number of accesses divided by timeHBMPCIe I/O NVLINKHSM SM SM SM U SM SM SM SMBSM SM SM SM SM SM SM SMXBARL2 L2 L2 L2Intel Xeon Platinum 8470Q(DDR5,8 channels)Peak:307 GB/sAchieved seq:238 GB/sAchieved 8B rnd:24 GB/

5、sNVIDIA H100-96GB(HBM3)Peak:4023 GB/sAchieved seq:3638 GB/sAchieved 8B rnd:306 GB/sPCIe Gen5:128 GB/s(bi-dir)6Grace Hopper SuperchipHOPPERGPUGRACECPUNVLINK C2C 900 GB/sCPU LPDDR5X 480 GBCPU LPDDR5XNVLINK NETWORK 256 GPUsHIGH-SPEEDI/OGPU HBM3 96GBGPU HBM318x NVLINK 4900 GB/sHardware Consistency4x16x

6、PCIe-5512 GB/shttps:/ Micro-benchmarkGrace Hopper achieved perf is 8-10 x faster than projected best x86 performance Performance limiter random 64-bit CAS/read7NVIDIA Decision Support-H BenchmarkNVIDIA Decision Support-H(NDS-H)is our adaptation of the TPC-H benchmark often used by database customers

友情提示

1、下载报告失败解决办法
2、PDF文件下载后,可能会被浏览器默认打开,此种情况可以点击浏览器菜单,保存网页到桌面,就可以正常下载了。
3、本站不支持迅雷下载,请使用电脑自带的IE浏览器,或者360浏览器、谷歌浏览器下载即可。
4、本站报告下载后的文档和图纸-无水印,预览文档经过压缩,下载后原文更清晰。

本文(ETL 处理的架构分析:CPU 与 GPU.pdf)为本站 (张5G) 主动上传,三个皮匠报告文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知三个皮匠报告文库(点击联系客服),我们立即给予删除!

温馨提示:如果因为网速或其他原因下载失败请重新下载,重复下载不扣分。
客服
商务合作
小程序
服务号
折叠