Crunch, Data Conference, October 16-18, 2019 Budapest
Wojciech Biela

Wojciech Biela

Co-founder at Starburst

Bio:

Wojciech Biela is a co-founder of Starburst and responsible for product development. He has a background of over 15 years of building products and running engineering teams. Previously Wojciech was the Engineering Manager at the Teradata Center for Hadoop, running the Presto engineering operations in Warsaw, Poland. Prior to that, back in 2011, he built and ran the Polish engineering team, a subsidiary of Hadapt Inc., a pioneer in the SQL-on-Hadoop space. Hadapt was acquired by Teradata in 2014. Earlier, Wojciech built and lead teams on multi-year projects, from custom big e-commerce & SCM platforms to POS systems. Wojciech holds a M.S. in Computer Science from the Wrocław University of Technology.

Workshop:

Presto: SQL-on-Anything, hands-on workshop

Topics:
data engineering
BI
SQL
ETL
data warehouse
analytics
Level:
Beginner

Description

Presto has become the ubiquitous open source software for SQL on anything. Presto is heavily used by Facebook, Netflix, Airbnb, LinkedIn, Twitter, Uber, and many others for low-latency querying large amounts of data, wherever it resides (Hadoop, AWS S3, Cassandra, Postgres, etc). Presto was engineered from the ground up for fast interactive SQL analytics against disparate data sources ranging in size from GBs to PBs.

Join Wojciech Biela for this full-day workshop to learn about Presto’s concepts, architecture and explore its many use cases and best practices you can implement today. Learn how to setup and use Presto through various hands-on exercises (those who don’t want to participate in the exercises can follow along).

Target audience

Roles: data engineers, data architects, software engineers, and those in IT

Prerequisite knowledge

A basic understanding of SQL, databases, Hadoop, and distributed systems.
Basic command line (Bash) skills.

Materials or downloads needed in advance

A laptop with a browser.

Agenda

Rough outline of the training, including slides and labs (hands-on exercises):

    • Presto architecture and technical concepts
    • Lab 1 - Manual Presto deployment
    • Presto query execution
    • Presto Ecosystem, Connectors and Connectivity
    • Migrating from Hive
    • Administering Presto
    • Presto in cloud environments
    • Lab 2 - Query S3 Data using Presto
    • Lab 3 - Query PostgreSQL using Presto
    • Lab 4 - Query Federation using Presto
    • Instructor lab demonstrations:
      • Lab 5 - Using Presto w/ AWS Glue Data Catalog
      • Lab 6 - Scaling Presto on AWS
    • Lab 7 - Presto and BI tools (connecting from Superset)
    • Query Performance, Cost-Based Optimizer
    • Lab 8 - Cost-Based Optimizer in Action
    • Security in Presto
    • Joining the Presto community