The Technical Architecture of a Data Lake Powered by Apache Hadoop | Big Data Analytics Presentation
Автор: Usha Karuturi
Загружено: 2025-12-01
Просмотров: 3
Описание:
This presentation explains how Apache Hadoop enables enterprise Data Lakes through scalable storage, distributed processing, and a powerful data ecosystem. It covers the key Hadoop components including HDFS and YARN, along with supporting tools such as Hive, Spark, Sqoop, and Flume. The video highlights the complete data life cycle in a Hadoop-based Data Lake — from ingestion and metadata management to data processing and analytics delivery.
🎯 Topics Covered:
What is a Data Lake?
Schema-on-Read vs Schema-on-Write
Hadoop core architecture: HDFS & YARN
Distributed storage and fault tolerance
Multi-tenant resource management
Hadoop ecosystem tools (Hive, Spark, Sqoop, Flume)
End-to-end Data Lake workflow
👤 Presenter:
Usha Sri Karuturi
CPSC 6730 – Big Data Analytics
Instructor: Agha Saadat
Governor's state university
Повторяем попытку...
Доступные форматы для скачивания:
Скачать видео
-
Информация по загрузке: