Accelerating Spark Workloads in a Mesos Environment with Alluxio - Gene Pang, Alluxio, Inc.
Автор: The Linux Foundation
Загружено: 2017-10-31
Просмотров: 100
Описание:
Accelerating Spark Workloads in a Mesos Environment with Alluxio - Gene Pang, Alluxio, Inc.
Organizations Mesos and Apache Spark together to gain insight from large amounts of data. It is common for Spark to process data stored in disparate public cloud storage, such as Amazon S3, Microsoft Azure Blob Storage, or Google Cloud Storage as well as on-premise data on HDFS, Ceph or ECS. This architecture results in sub-optimal performance as data and compute are not co-located.
Using Alluxio, a memory speed virtual distributed storage system, deployed on Mesos enables connecting any compute framework, such as Apache Spark, to storage systems via a unified namespace. Alluxio enables applications to interact with any data at memory speed. Alluxio can eliminate the pains of ETL and data duplication, and enable new workloads across all data. Gene will discuss the architecture of Mesos, Spark and Alluxio to achieve an optimal architecture for enterprises.
About Gene Pang
Gene Pang is one of PMCs and maintainers of the Alluxio open source project and a founding member at Alluxio, Inc. He recently graduated with a Ph.D. from the AMPLab at UC Berkeley, working on distributed database systems. Before starting at Berkeley, he worked at Google and has an M.S. from Stanford University, and B.S. from Cornell University.
Повторяем попытку...
Доступные форматы для скачивания:
Скачать видео
-
Информация по загрузке: