当前位置:首页 > 报告详情

加速边缘推理:利用基于 DWDM 的 IP 技术为 AI 时代解锁可扩展、安全且低延迟的连接.pdf

上传人: 明**** 编号:1011799 2025-12-21 14页 1.03MB

1、Frank YangAccelerating Inference at the Edge:Unlock Scalable,Secure,and Low-Latency Connectivity for the AI Era with IP over DWDMAccelerating Inference at the Edge:Unlock Scalable,Secure,and Low-Latency Connectivity for the AI Era with IP over DWDMFrank YangEdgeNearly 95%of AI spend is on inference(

2、run-time)vs.pre-training,according to Menlo Ventures enterprise AI survey,2024 79%of financial services companies follow a distributed data approach,according to Digital Realtys AI in financial services survey,202477%of the respondents believe inferencing at the Edge to be an essential strategy for

3、managing AI workloads effectively,according to Datacenter Dynamics edge AI survey,2025.Some Interesting Survey ResultsAI Inference WorkflowAI Framework:TensorFlow,PyTorch,TensorRT,etc.Model OptimizationInference ServingFar EdgeAI ApplicationData SourcesAI InfrastructureTrained ModelsModel Repository

4、Edge Data CenterCore Data Center/CloudAI Inference Use Case ExplorationsUse CaseEdge Workload Partial inference Core Workload Final inferenceKey Network RequirementsLatency and BandwidthSmart Retail/Mall AnalyticsShopper detection,anonymized embeddings,foot traffic heatmapsCross-store profiling,camp

5、aign analytics,personalizationModerate bandwidth(cameraedge),low local latency(100 ms),privacy/anonymizationHealthcare Imaging/DiagnosticsDICOM ingestion,image preprocessing,candidate lesion detectionMulti-modal correlation,ensemble diagnosis,access to historical records/EMRHigh bandwidth(images),lo

6、wish latency for triage(500 ms),strict security/complianceReal-Time Financial Fraud DetectionTransaction scoring,feature extraction,early anomaly flagging at branch/POSGraph correlation,cross-bank analysis,escalation and identity verificationUltra-low latency(50 ms),extremely reliable and auditable

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
根据《Data》标记中的内容,全文主要内容概括如下: - **AI推理在边缘计算中的重要性**:AI推理在AI支出中占比高达95%,边缘推理被视为管理AI工作负载的关键策略。 - **边缘AI推理的关键网络需求**:低延迟和高带宽是边缘AI推理的关键网络需求。 - **传统架构的挑战**:传统架构存在延迟、成本高、可扩展性差等问题。 - **IP over DWDM的优势**:IP over DWDM技术降低延迟、简化架构、降低成本,并支持可扩展部署。 - **技术组件**:包括100G/400G/800G相干模块、被动复用/解复用器、加密和网络自动化工具。 - **解决方案**:IP over DWDM在边缘-核心-云编排中的应用,以及降低延迟和复杂性的优势。 - **开放性和可扩展性**:IPoDWDM与OCP原则一致,支持开放、解耦、高效和可扩展的网络。 - **生态系统构建**:呼吁共同推动边缘AI传输的开放性和可扩展性。
IPoDWDM如何助力?" "AI时代,如何降低边缘推理延迟?" 打造开放边缘AI网络新方案?"
客服
商务合作
小程序
服务号
折叠