The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt
-
Updated
Jun 11, 2024 - Python
The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt
This project leverages spotify's api and provided user playlists to create and tune a neural network model that generates song recommendations based off of song data in provided playlists.
End-to-End Machine Learning project I made as a machine learning intern @ Mentorness
A platform enables sharing diverse knowledge, but similarly worded questions are common. We use NLP techniques to identify duplicate questions, enhancing user experience by making it easier to find high-quality answers.
All Statistics concepts
Temporian is an open-source Python library for preprocessing ⚡ and feature engineering 🛠 temporal data 📈 for machine learning applications 🤖
Up to 90% accuracy with just 5 features using KNN algorithm and PCA for feature engineering. The dataset contained less than 1000 observations. The model's accuracy could be improved using more observations, further hyperparameter optimization and feature engineering
Comprehensive notes and code on Python, data analysis, visualization, machine learning, and deep learning from my data science learning journey.
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
This project uses Exploratory Data Analysis (EDA) to uncover trends and insights from restaurant cuisine ratings, helping improve menus, enhance customer experiences, and guide targeted marketing strategies for business success.
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
Data Science Projects done at Data Trained Education during PG in Data Science and Machine Learning.
A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine
EvalML is an AutoML library written in python.
Smart Meter Analytics Python - A Python implementation for analysis of energy consumption data (electricity, gas, water) at different data measurement intervals. The package provides feature extraction methods and algorithms to prepare data for data mining and machine learning applications
Add a description, image, and links to the feature-engineering topic page so that developers can more easily learn about it.
To associate your repository with the feature-engineering topic, visit your repo's landing page and select "manage topics."