etl-pipeline

Modeled the data with Apache Cassandra and completed an ETL pipeline using Python. Tables in Apache Cassandra to run queries are created and the ETL pipeline that transfers data from a set of CSV files within a directory are defined to create a streamlined CSV file to model and insert data into Apache Cassandra tables.

python cassandra cassandra-database etl-pipeline cassandra-tables

Updated Jun 7, 2021
Jupyter Notebook

SalSuwai / Data_Warehouses_AWS

Star

applying data warehouses tools and AWS to build an ETL pipeline for a database hosted on Redshift. loading data from AWS S3 bucket to staging tables on Redshift and executing SQL statements that create the analytics tables from these staging tables.

aws sql s3-bucket data-warehouse data-engineering etl-pipeline redshift-cluster staging-tables

Updated Jun 14, 2021
Jupyter Notebook

sanogotech / Luigi-ETL-Python-Tutorial

Star

An introductory tutorial about Python Luigi. ETL

python spotify etl luigi etl-pipeline

Updated Feb 9, 2022
Python

ManuelRuizF / Movies-ETL

Star

Created an automated pipeline that takes in new data from a movie set. Performed the appropriate transformations, and loaded the data into existing tables. Performed the ETL process by adding the data to a PostgreSQL database.

python etl jupyter-notebook dataset extract-transform-load etl-pipeline

Updated May 6, 2022
Jupyter Notebook

AnaRita93 / Public-Sentiment-on-COVID-Vaccination-using-Twitter-Data-

Star

This project aims to classify and analyze the public sentiment on COVID vaccination across time, languages and countries. It's divided in 3 parts: data collection and transformation;exploratory analysis and data visualization with a dashboard; topic modelation and sentiment analysis.

nlp sentiment-analysis data-visualization topic-modeling tableau etl-pipeline twiter-analysis

Updated Dec 5, 2021

write4alive / Data-Warehouse

Star

Data Engineering Nano Degree Programm of Udacity - Project 3 - Project: Data Warehouse

python sql sql-query data-structures infrastructure-as-code etl-pipeline

Updated Feb 24, 2022
Python

pkorat / Extract.Transform.Load-Life_Expectancy

Star

An implementation of the data integration process Extract, Transform, Load (ETL)

sql etl-pipeline dataintegration

Updated Aug 10, 2021
Jupyter Notebook

taiwofawumi / DE_ETL_AWS_S3

Star

In this script, I created an ETL pipeline that extracts datasets from the AWS S3 source bucket on a scheduled basis, creates a report using transformations and load the transformed data to another AWS S3 target bucket.

python aws engineering data etl aws-s3 boto3 etl-pipeline

Updated May 2, 2022
Jupyter Notebook

arfatmateen / Data_Modeling_with_Apache_Cassandra

Star

Database Schema & ETL pipeline for Song Play Analysis | Bosch AI Talent Accelerator Scholarship Program

python jupyter-notebook cql apache-cassandra etl-pipeline

Updated Sep 14, 2022
Jupyter Notebook

Improve this page

Add a description, image, and links to the etl-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the etl-pipeline topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

etl-pipeline

Here are 1,398 public repositories matching this topic...

mlight9 / DataModelingwithPostgreSQL

manuel-lang / Data-Modeling-with-Cassandra

Vitalii36 / Data_engineering_cryptocurrencies

AndoKalrisian / Data-Modelling-Postgres-sample-project

anthelix / udacity_project5

ApoorvaShukla88 / AirflowProject

soyelherein / pyspark-tdd-template

peterjprudhomme / Sparkify-ETL-Pipeline

Neurosecond / Python-ETL

Shivaae / Disaster-Response-Pipelines

nelsonssjunior / analista_de_bi

seiterjoseph / data-modeling-cassandra

SalSuwai / Data_Warehouses_AWS

sanogotech / Luigi-ETL-Python-Tutorial

ManuelRuizF / Movies-ETL

AnaRita93 / Public-Sentiment-on-COVID-Vaccination-using-Twitter-Data-

write4alive / Data-Warehouse

pkorat / Extract.Transform.Load-Life_Expectancy

taiwofawumi / DE_ETL_AWS_S3

arfatmateen / Data_Modeling_with_Apache_Cassandra

Improve this page

Add this topic to your repo