当前位置:首页 > 报告详情

设备本地诊断.pdf

上传人: 明**** 编号:1011985 2025-12-21 29页 1.90MB

1、Shashank Neelam,GoogleGreg Boudreau,CiscoDevice Local DiagnosisDevice Local DiagnosisShashank Neelam,GoogleGreg Boudreau,CiscoSONiC Workshop1.Overview2.Vendor Defined Rules/Schema3.NOS On-Device Monitor(SONiC)4.OpenConfig TelemetryCurrent State of Health Monitoring on the SwitchChallengeDescriptionL

2、imited Remediation&SpecificityCurrent monitoring for common FRUs(e.g.,PSUs,fans via SONiC psud,thermalctld),often lacks automated remediation steps and fails to monitor for specific,nuanced hardware failure scenarios.Inflexible Signal HandlingCurrent systems are often unable to interpret unique sign

3、als or anomalous system behaviors from components,limiting proactive fault detection.Over-reliance on Log ParsingCritical system health reporting is heavily dependent on parsing logs,which is a reactive,rather than proactive,approach to identifying failures.Incomplete Component CoverageMonitoring is

4、 frequently missing for smaller,less common,but still critical,components on a device,creating blind spots in overall system health.Issues for Network OperatorsA generic framework for hardware vendors to define discrete rules for monitoring and managing hardware health and failures respectively with

5、 the following characteristics:Supports granularity and flexibility to cover a wide range of sourcesDesigned to be structured for supporting multiple underlying HW platforms,SW versions,and HW versionsGeneric inputs and outputs between system and operator regardless of underlying software.What is de

6、vice local diagnosis?StandardizedWhy use device local diagnosis?Rapid Failure ResponseMinimizes lag time between detection of failure and beginning of recovery logic(both locally and from remote sources)Vendor Defined Device IntelligenceFully defined by HW vendor w/best insight into underlying behav

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
根据报告的内容,全文主要内容概括如下: 1. **当前健康监控挑战**: - 缺乏自动修复步骤和特定硬件故障场景的监控。 - 信号处理不灵活,难以解释独特信号或异常行为。 - 过度依赖日志解析,缺乏主动性。 - 组件覆盖不完整,存在监控盲点。 2. **设备本地诊断(Device Local Diagnosis)**: - 标准化框架,支持硬件厂商定义监控和管理规则。 - 提高故障响应速度,减少故障检测与恢复之间的延迟。 - 提供设备厂商定义的本地智能。 - 使用标准化框架和故障遥测,简化操作。 3. **规则和组件**: - 规则定义故障源、评估逻辑和修复步骤。 - 数据源扩展(DSE)用于简化规则,允许从不同来源获取信息。 4. **目标**: - 为网络运营商提供通用框架,支持多种硬件平台和软件版本。 - 减少网络运营商的复杂性和部署难度。
如何快速响应故障?" 简化规则定义的秘诀?" 网络运营商的利器?"
客服
商务合作
小程序
服务号
折叠