使用 Spark Streaming 和 Delta Lake 将身份图谱提取扩展到每秒 100 万个事件.pdf

编号:718756 PDF 43页 2.07MB 下载积分:VIP专享
下载报告请您先登录!

使用 Spark Streaming 和 Delta Lake 将身份图谱提取扩展到每秒 100 万个事件.pdf

1、Scaling Identity Graph Ingestion to 1M events/sec using Spark structured streaming&Delta LakeAkanksha NagpalJianmei Ye2021 Adobe.All Rights Reserved.Adobe Confidential.2021 Adobe.All Rights Reserved.Adobe Confidential.IntroductionAkanksha NagpalSr.Software Engineer,AdobeLinkedIn:akanksha_nagpalJianm

2、ei YeSr.Software Engineer,AdobeLinkedIn:jianmeiye2020 Adobe.All Rights Reserved.Adobe Confidential.Agenda Overview Adobe Identity Graph Journey to 10 x Scaling&Optimization techniques Privacy Compliance Strategies Custom deployment workflows Lessons Learned&Takeaways Q/A Overview2020 Adobe.All Right

3、s Reserved.Adobe Confidential.Adobe Experience Platform:Real time CDPUnified Customer ProfilesActionable AudiencesActivationPersonalization at scale2020 Adobe.All Rights Reserved.Adobe Confidential.Adobe Identity GraphWhat is Identity Graph?Unifies fragmented identifiers(e.g.,emails,device IDs,cooki

4、es)into a single viewEnables consistent consumer recognition across channels and devices2020 Adobe.All Rights Reserved.Adobe Confidential.Identity Service Petabytes data processed daily1M records/sec50+billion identitiesThese capabilities are built from the ground up to link disconnected identities

5、into a single,unified profile to deliver consistent,connected experiences.70B+billion records/dayJourney to 10 x Evolution Phases2020 Adobe.All Rights Reserved.Adobe Confidential.Data Collection Streaming Topic 1PipelineStream Processing PipelineStreaming Topic 2Streaming Topic 3Initial Architecture

6、Identity Graph storeJob ManagerJob ServiceTask ManagerKubernetes Cluster2020 Adobe.All Rights Reserved.Adobe Confidential.Challenges Why we need to evolve?Operational OverheadSingle pipeline Coupled processing+StorageFragmented logic across batch&streaming Noisy Neighbor Multi-tenant pipelineWe init

友情提示

1、下载报告失败解决办法
2、PDF文件下载后,可能会被浏览器默认打开,此种情况可以点击浏览器菜单,保存网页到桌面,就可以正常下载了。
3、本站不支持迅雷下载,请使用电脑自带的IE浏览器,或者360浏览器、谷歌浏览器下载即可。
4、本站报告下载后的文档和图纸-无水印,预览文档经过压缩,下载后原文更清晰。

本文(使用 Spark Streaming 和 Delta Lake 将身份图谱提取扩展到每秒 100 万个事件.pdf)为本站 (Flechazo) 主动上传,三个皮匠报告文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知三个皮匠报告文库(点击联系客服),我们立即给予删除!

温馨提示:如果因为网速或其他原因下载失败请重新下载,重复下载不扣分。
客服
商务合作
小程序
服务号
折叠