当前位置:首页 > 报告详情

时尚前卫安全至上:大规模支持新一代人工智能.pdf

上传人: 可*** 编号:991871 2025-12-07 21页 1.81MB

1、Fashion Forward,Security First:Supporting Gen AI at ScaleFlorence MottayCISO ZalandoStand up if youve worked on an AI system Remain standingif youve ever worked on a red team or security assessment.Stand up if youve worked on an AI system Remain standingif youve ever worked on a red team or security

2、 assessment.Remain standingif youve conducted prompt injection testing or similar techniques.Stand up if youve worked on an AI system Fashion-forward security firstWhere itall startedChatGPT-poweredZalando assistant“With our Zalando Assistant,we can help customers find what to wear for a certain occ

3、asion-a birthday party,a business meeting or even hiking to Machu Picchu.Customers can get inspired by a certain style,celebrity,or cultural moment the possibilities are almost endless.”SecurityassessmentThe risks we faced:PrivacySecurityBut alsoBiasesInappropriate contentMisinformation,hallucinatio

4、nand robustness issuesa new world!UserExternalresourcesPersistent storageLLM(e.g.ChatGPT)AdversaryApp(e.g.semantic)Output(e.g.candidates)APIExternalThird-partyZalandoOutputsIndirect Prompt InjectionPrompt InjectionLLM may have access to external sources(e.g.web or DBs)LLM may have the capability to

5、write to some persistent storageThreat modelling123SecurityassessmentA few examplesWill ZA fabricate information regarding Terms and Conditions,refund policy,shipping,.at Zalando?Will ZA provide the same outcome for all genders,all backgrounds of customers?Is ZA susceptible to jailbreak attacks?Reme

6、diationtimeFine tuningFine tuned the model with classifier training80K prompts Today,every customer message is being parsed by our safety classifier as well as the OpenAI Moderation API Fashion-forward security firstHow it evolvedTwo main pillarsAI threat modelingAI red teaming Framework HighlightsC

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
根据报告的内容,全文主要内容概括如下: - **Zalando AI助手**:Zalando开发了ChatGPT驱动的AI助手,帮助顾客根据场合选择服装。 - **安全挑战**:AI系统面临隐私、安全、偏见、不适当内容和误导性信息等风险。 - **安全措施**:Zalando实施了AI威胁建模和AI红队测试,以识别和缓解风险。 - **AI威胁建模**:通过定向提示和红队数据集来识别AI风险。 - **AI红队测试**:使用定向提示和突变提示攻击LLM,以发现潜在漏洞。 - **成果**:通过模型微调和安全分类器,提高了系统的安全性。 - **未来展望**:Zalando持续改进其AI安全框架,确保业务和网络安全紧密合作。
揭秘Zalando的应对之道" AI安全如何“时尚”升级?" Zalando如何防范AI系统漏洞?"
客服
商务合作
小程序
服务号
折叠