当前位置:首页 > 报告详情

MangoBoost 全栈 AI 基础设施解决方案:MLPerf 推理、训练、存储案例研究.pdf

上传人: 明**** 编号:1011622 2025-12-21 26页 6.52MB

1、MangoBoostOCP Global Summit 2025MangoBoost Full-Stack AI Infrastructure Solutions MLPerf Inference,Training,Storage Case StudiesWebsiteContactwww.mangoboost.iocontactmangoboost.ioEriko Nurvitadhi,PhD,MBAChief Product Officer&Co-Founder 2025 MangoBoost,Inc.All rights reserved.Do Not distribute withou

2、t permission.2AI Era and Key BottlenecksPART I.2025 MangoBoost,Inc.All rights reserved.Do Not distribute without permission.The AI Era Calls for Extreme HW/SW Engineeringneeds more interconnectionsGenerative AI Boom“DeepSeeks modular,distributed AI training will likely drive demand for efficient net

3、working solutions”-Source:ForbesGPU clusterGPU serverGPU serverGPU serverIntra-server networkIntra-server NetworkInter-server network(through external switches)3 2025 MangoBoost,Inc.All rights reserved.Do Not distribute without permission.4Complicated AI SW StackApplication APIs(Python,REST,OpenAI,W

4、ebSocket,etc.)AI MicroservicesDeployment(Inference Serving)Data Analytics(RAG,etc.)Fine-tuning(LoRA,etc.)TrainingInfrastructure Management(Kubernetes,admin UI,monitoring,etc.)Multi-Node Scaling&Optimized Kernels(collective communication,auto-scaling,etc.)Problem#1:Extremely Difficult Software Engine

5、ering 2025 MangoBoost,Inc.All rights reserved.Do Not distribute without permission.Problem#2:Extremely Difficult Hardware(Network)EngineeringScaling of Peak Hardware FLOPS and Interconnect BandwidthIncreasing Inter-node communications Source:AI and Memory Wall“AI applications are bottlenecked by com

6、munication across/to AI accelerators,rather than compute.”-Amir Gholami,International Computer Science Institute,UC Berkeley 5 2025 MangoBoost,Inc.All rights reserved.Do Not distribute without permission.Traditional Data Center:CPU runs heavy data center I/O tasksCPU runs heavy data center I/O tasks

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
根据报告的内容,全文主要内容概括如下: 1. **AI时代挑战**:AI时代需要极端的软硬件工程,面临软件和硬件(网络)工程的难题,以及快速、智能和灵活的网络加速器需求。 2. **MangoBoost解决方案**:提供全面的AI基础设施解决方案,包括GPU服务器、MB-DPU、远程存储和高速网络连接,显著降低总拥有成本(TCO)。 3. **MangoLLMBoost软件**:自动优化配置、扩展、调度和并行化,易于使用,支持多种基础模型。 4. **Mango GPUBoost™**:优化GPU数据传输,提供可编程拥塞控制,兼容标准RoCEv2,无需特殊交换机。 5. **Mango BoostX存储解决方案**:提供端到端存储解决方案,实现零CPU消耗的I/O路径,支持NVMe虚拟化。 6. **MLPerf结果**:在MLPerf Inference和Training中展示出卓越的性能和效率,在MLPerf Storage中提供行业领先的性能。 7. **性能对比**:与竞争对手相比,MangoBoost在成本效率和性能方面具有优势。
AI时代加速器揭秘" MangoBoost性能如何?" MangoBoost解决方案详解"
客服
商务合作
小程序
服务号
折叠