-Introduction to Distributed Computing - What is In-Memory Computing? - Spark v/s MapReduce - Spark Architecture Overview - Explore the Data - Create RDD, DataFrame, and Datasets - Transformations and Actions using Spark - Draw Insights on a given dataset.
=========== Pre-Requisites ============ - Laptop with 4 GB RAM - Basic Programming Knowledge (Java or Scripting) - Min 2+ years Experience in IT
========= Outcome: ======== Gain insight into Distributed Computing Understand the Spark Architecture and appreciate the power of in-memory computation Based on a given dataset, quickly analyze and come up with useful insights using Spark. Gain a clear understanding of where Spark fits in the industry.
========================= Who Should Attend (min 2+ years) ========================== - ETL Developers and Testers who are stuck in legacy Data Warehouse - Manual Testers who see no future in their current role. - People with 20+ experience who have plateaued in their career.