The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
-
Updated
Jun 12, 2024 - Python
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Feldera Continuous Analytics Platform
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
Privacy and Security focused Segment-alternative, in Golang and React
Aqueduct Core is responsible for the core functionality of Aqueduct, an experiment management system.
Solução completa dedicada a realizar ETL de dados de cotações de moedas usando Python. Fonte dos dados: https://docs.awesomeapi.com.br/api-de-moedas
Jayvee is a domain-specific language and runtime for automated processing of data pipelines
Flink CDC is a streaming data integration tool
Cryptocurrency prediction using LSTM (Long Short Term Memory)
Conduit streams data between data stores. Kafka Connect replacement. No JVM required.
A repository for the Methods of Advanced Data Engineering course at FAU
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Customer Data Platform (CDP)
Pythonic tool for orchestrating machine-learning/high performance/quantum-computing workflows in heterogeneous compute environments.
Resilient data pipeline framework running on Apache Spark
使用ETL data pipeline 將UBER 資料清洗、排程、最後放置在GCP上運行與後續分析 的專案
Developed a deep learning model utilizing TensorFlow to automate the classification of financial documents. Leveraging a Bidirectional LSTM RNN, we accurately categorize the documents. Our user-friendly Streamlit application ensures high accuracy & efficiency in document management, all deployed on the Hugging Face platform for seamless integration
Apply Data Engineering to Personal Finance
Backend service for Scribe app data downloads
Add a description, image, and links to the data-pipeline topic page so that developers can more easily learn about it.
To associate your repository with the data-pipeline topic, visit your repo's landing page and select "manage topics."