通过研究支持的指标降低 LLM 幻觉风险.pdf

编号:167731 PDF 42页 5.33MB 下载积分:VIP专享
下载报告请您先登录!

通过研究支持的指标降低 LLM 幻觉风险.pdf

1、2024 Databricks Inc.All rights reserved1Vikram ChatterjiVikram ChatterjiJune 11,2024June 11,2024Mitigating LLM Hallucination RiskHallucination RiskThrough Research Backed Metrics2024 Databricks Inc.All rights reserved2024 Databricks Inc.All rights reserved2NLP at scaleBottleneck:Bottleneck:Input/Out

2、put Evaluations cost millions$and took months.AI Evaluations at AI Evaluations at Scale.Scale.Powered by research-backed metricsFocus for today:Focus for today:As NLP has transitioned to GenAI,what does this mean for Evaluations of these new AI Systems?We will discuss 2 new methods for high accuracy

3、 metrics.2024 Databricks Inc.All rights reserved2024 Databricks Inc.All rights reserved3NonNon-deterministic deterministic nature of LLMs nature of LLMs 2024 Databricks Inc.All rights reserved4“LLMs are dream machines”“LLMs are dream machines”2024 Databricks Inc.All rights reserved5“DreamsDreams”:fe

4、ature or bug?:feature or bug?2024 Databricks Inc.All rights reserved6We are in the Era of NonWe are in the Era of Non-Deterministic Software.Deterministic Software.=New crop of concerns=New crop of concerns for Enterprise AI for Enterprise AI 2024 Databricks Inc.All rights reserved7McKinsey State of

5、 AI Report 20242024 Databricks Inc.All rights reserved2024 Databricks Inc.All rights reserved8How How AI Teams AI Teams Detect/Evaluate Detect/Evaluate HallucinationsHallucinationsToday.Today.Quantifying LLM HallucinationsQuantifying LLM HallucinationsN N-Gram Matching Gram Matching Ask GPT Ask GPT

6、There are 3 TechniquesThere are 3 TechniquesHuman EvaluationHuman Evaluation123BLEU|ROUGEBLEU|ROUGE-N N Compare to one or more reference completions.A score between zero and one indicating similarity to the reference,one indicating a perfect matchMETEORMETEOR Consider synonym,stemming and word order

友情提示

1、下载报告失败解决办法
2、PDF文件下载后,可能会被浏览器默认打开,此种情况可以点击浏览器菜单,保存网页到桌面,就可以正常下载了。
3、本站不支持迅雷下载,请使用电脑自带的IE浏览器,或者360浏览器、谷歌浏览器下载即可。
4、本站报告下载后的文档和图纸-无水印,预览文档经过压缩,下载后原文更清晰。

本文(通过研究支持的指标降低 LLM 幻觉风险.pdf)为本站 (张5G) 主动上传,三个皮匠报告文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知三个皮匠报告文库(点击联系客服),我们立即给予删除!

温馨提示:如果因为网速或其他原因下载失败请重新下载,重复下载不扣分。
客服
商务合作
小程序
服务号
折叠