CEO and Senior Instructor at Datapao, a Big Data and Cloud consultancy and training firm, focusing on industrial applications (aka Industry 4.0). Datapao helps Fortune 500 companies kick off and mature their data analytics infrastructure by giving them Apache Spark, Big Data and Data Analytics training and consultancy. Mate also serves as Senior Instructor in the Professional Services Team at Databricks, the company founded by the authors of Apache Spark. Previously he was Co-Founder and CTO of enbrite.ly, an award-winning Budapest based startup.
Mate has experience spanning more than a decade with Big Data architectures, data analytics pipelines, operation of infrastructures and growing organisations by focusing on culture. Mate also teaches Big Data analytics at Budapest University of Technology and Economics. Speaker and organiser of local and international conferences and meetups.
This 1-day course is for data engineers, analysts, architects, data scientist, software engineers, IT operations, and technical managers interested in a brief hands-on overview of Apache Spark.
The course provides an introduction to the Spark architecture, some of the core APIs for using Spark, SQL and other high-level data access tools, as well as Spark’s streaming capabilities and machine learning APIs. The class is a mixture of lecture and hands-on labs.
Each topic includes lecture content along with hands-on labs in the Databricks notebook environment. Students may keep the notebooks and continue to use them with the free Databricks Community Edition offering after the class ends; all examples are guaranteed to run in that environment.