Skip to content

ua-datalab/DataEngineering

Repository files navigation

UArizona Data Lab Workshops - Spring 2024

Data Engineering

Navigating the World of Engineering

How can you master handling massive datasets and transform raw data into insightful, actionable information?

Join our workshops to dive deep into advanced data management and analysis techniques designed for graduate students. Discover the secrets of efficient database management, unravel the complexities of ETL (Extract, Transform, Load) processes, and get hands-on experience with cutting-edge big data technologies.

Are you ready to elevate your data engineering skills and stand out in the rapidly evolving field of data science?

Are you curious about how to kickstart your journey in data engineering with user-friendly tools before diving deep into the core complexities of the field?

We begin our workshop series with an accessible introduction to Streamlit and Gradio, crafting interactive web applications to visualize and manipulate data effortlessly. However, this is just the beginning. As the weeks progress, we will seamlessly transition into the heart of data engineering, unraveling the intricacies of ETL (Extract, Transform, Load) processes. This gradual progression ensures a solid foundation, paving the way for you to master advanced data engineering techniques with confidence. Are you ready to evolve from creating engaging data-driven applications to mastering the art of data extraction, transformation, and loading?


RESOURCES AND NOTES:

Date Topic Resources
01/29/24 Building Python web apps with Streamlit and Gradio Streamlit - Notebook Open In Colab
Gradio - Notebook Open In Colab
Presentation Slides
Youtube Video
02/05/24 Deploying ML models with Streamlit and Gradio Streamlit - Notebook Open In Colab
Gradio - Notebook Open In Colab
Presentation Slides
Youtube Video
02/12/24 Introduction to SQL Part-1 SQL and duckDB Notebook Open In Colab
Presentation Slides
Youtube Video
02/19/24 Introduction to SQL Part-2 SQL and duckDB Notebook Open In Colab
Presentation Slides
Youtube Video
02/26/24 Introduction to noSQL Part-1 mongoDB-Pymongo Notebook Open In Colab
Presentation Slides
Youtube Video
03/04/24 Spring Break -
03/11/24 Introduction to noSQL Part-2 Cassandra Notebook
Copy the link above and open using Jupyter's Open from URL function on Cyverse
Presentation Slides
Youtube Video
03/18/24 Introduction to Hadoop and Hive Hadoop & Hive Notebook Open In Colab
Presentation Slides
Youtube Video
03/25/24 Introduction to Spark and PySpark Spark-PySpark Notebook Open In Colab
Presentation Slides
Youtube Video

CC BY-NC-SA 4.0