1、融合趋势下基于 Flink Kylin Hudi 湖仓一体的大数据生态体系 杨华、王祥虎 Flink/Hudi/Kylin 介绍与融合 #2 湖仓一体的架构 #1#3 T3出行构建 湖仓一体的实践 什么是数据湖? AWS的定义: A data lake is a centralized repository that allows you to store all your structured and unstructured data at any scale. You can store your data as-is, without having to first structure
2、 the data, and run different types of analyticsfrom dashboards and visualizations to big data processing, real-time analytics, and machine learning to guide better decisions. 数据湖是一个集中式的存储库,允许您以任意规模存储所有结构化和非结构化数据。您可以按原样存储数据(无 需先对数据进行结构化处理),并运行不同类型的分析 从控制面板和可视化到大数据处理、实时分析和机器学 习,以指导做出更好的决策。 什么是数据仓库? AW
3、S的定义: A data warehouse is a central repository of information that can be analyzed to make more informed decisions. Data flows into a data warehouse from transactional systems, relational databases, and other sources, typically on a regular cadence. Business analysts, data engineers, data scientists, and decision makers access the data through business intelligence (BI) tools, SQL clients, and oth