Data Architect | Section 4 – How Spark Optimizes Your Query | Catalyst Optimizer | Apache Spark
Автор: Data Architect Diaries
Загружено: 2026-03-01
Просмотров: 27
Описание:
Catalyst Optimizer in Apache Spark – Deep Dive
In this video, we take a complete deep dive into the Catalyst Optimizer in Apache Spark — the intelligent query optimization engine behind Spark SQL, DataFrames, and Datasets.
If you’ve ever wondered:
How Spark converts logical plans into physical plans
How joins are optimized automatically
What is Rule-Based Optimization (RBO)
What is Cost-Based Optimization (CBO)
How Adaptive Query Execution (AQE) works
How Spark reduces shuffle and improves performance
This session will give you architect-level clarity.
What You’ll Learn
✔ Catalyst Optimizer Architecture
✔ Logical Plan vs Physical Plan
✔ Analyzer & Optimizer Phases
✔ Predicate Pushdown & Column Pruning
✔ Join Reordering & Broadcast Join
✔ Cost-Based Optimization (CBO)
✔ Adaptive Query Execution (AQE)
✔ Real-world PySpark examples
✔ Production-level performance insights
Catalyst Optimizer in Apache Spark
Spark SQL Query Optimization
Spark Logical Plan vs Physical Plan
Spark Analyzer and Optimizer
Spark Rule-Based Optimization
Spark Cost-Based Optimization
Adaptive Query Execution Spark
Spark Join Strategy Optimization
Broadcast Hash Join Spark
Spark Performance Tuning Techniques
Spark Query Execution Flow
Spark Execution Plan Explained
Повторяем попытку...
Доступные форматы для скачивания:
Скачать видео
-
Информация по загрузке: