《ClickHouse 和 Databricks 用于实时分析.pdf》由会员分享,可在线阅读,更多相关《ClickHouse 和 Databricks 用于实时分析.pdf(20页珍藏版)》请在三个皮匠报告上搜索。
1、ClickHouse and Databricks for Real-Time AnalyticsMelvyn Peignon-Principal Product Manager|ClickHouseJune 11,2025What is ClickHouse?3Your extremely fast databaset FasterLegacy on-prem data platformCloud fata warehouseTransactional and general purpose databaseAd-hoc and periodic workflowsEg.basic data
2、 discovery,generating reportsHighly concurrent and interactiveEg.immersive data drive apps,monitoring/alerting/detectionsReal-time analytics databaseElasticsearchDruidUse Cases 4 Data warehousingData warehousingInteractively slice and dice your data for analysis,reporting,and building internal appli
3、cations.Evaluate user behaviors,ad and media perf,market dynamics,and more.ML and Gen AIML and Gen AIExecute fast and efficient vector search.Plug-and-play Generative AI models from any provider.Use lightning-fast aggregations to power model training at petabyte scale.ClickHouse Adoption5A fast grow
4、ing open-source project40K+Stars7K+Forks1.6k+Contributors490+Active contributorsCommits over timeGithub statisticsDesigned for real Time6ComputezMV 1 Table 2MV 2zzTable 5 MV 5zprojectionprojection Application4 Servert ServerClickHouse Native Formatt High insert throughputt Completely self-containedt
5、 Distributed by designOn top of any Source7Server Application4 ServerServerG 0 Data Lake0m Table 1 MV 1Table 2 Federated queries with many different systemsb Wide array of connectors+Multi-cloudsupportAvailable in the cloud or self-managedReal-time Analytics with DatabricksYour not so typical databa
6、seYour favorite data platform8Question:How to leverage the speed of ClickHouse on top of your data platform?ClickHouse and Open Table Formats9Hot+Writers Writers Table 1MV 1z Application+Data lakeHistory of Delta Lake Support10Starting in the communityBut not onlyHistory of Delta