feature-selection

Here are 1,520 public repositories matching this topic...

selvatica-36 / predicting-fuel-economy

This project aims to understand and predict a car's fuel efficiency based on its characteristics. I have built a multiple linear regression model using stats models and scikit-learn.

scikit-learn regression feature-selection supervised-learning feature-engineering predictive-modeling statsmodels

Updated Jun 12, 2024
Jupyter Notebook

harveybc / preprocessor

Star

Data pre-processing with modular components for: normalizer/standarizer, unbiaser, trimmer and feature selector.

machine-learning reinforcement-learning timeseries deep-learning preprocessor openai-gym regression feature-selection dataset feature-extraction classification standardization preprocessing spectral-analysis trims

Updated Jun 12, 2024
Python

sanhiitaa / salary-prediction

Star

End-to-End Machine Learning project I made as a machine learning intern @ Mentorness

data-science machine-learning deployment pipeline linear-regression machine-learning-algorithms feature-selection data-preprocessing feature-engineering lasso-regression model-comparison random-forest-regression gradient-boosting-regressor xgboost-regression streamlit pickle-file

Updated Jun 11, 2024
Jupyter Notebook

mlr-org / mlr

Sponsor

Star

Machine Learning in R

Updated Jun 11, 2024
R

SerkanGuldal / sentetik

Star

Synthetic data generation package to balance imblanaced datasets

machine-learning feature-selection feature-extraction imbalanced-data

Updated Jun 11, 2024
Python

mlr-org / mlr3fselect

Sponsor

Star

Feature selection package of the mlr3 ecosystem.

machine-learning r optimization feature-selection evolutionary-algorithms r-package random-search recursive-feature-elimination exhaustive-search mlr3 sequential-feature-selection

Updated Jun 11, 2024
R

HannaMeyer / CAST

Star

Developer Version of the R package CAST: Caret Applications for Spatio-Temporal models

machine-learning spatial variable-selection feature-selection caret autocorrelation predictive-modeling spatio-temporal overfitting

Updated Jun 11, 2024
R

sachin14596 / Customer-Churn-Prediction-Feature-Selection-using-Genetic-Algorithm

Star

This project explores an IBM telecom dataset, conducting initial EDA and data preprocessing. It examines three genetic algorithm variations for feature selection: one-point, two-point, and uniform crossover. Logistic regression is used to predict customer churn, and performance is evaluated using error bar plots.

genetic-algorithm feature-selection evolutionary-algorithms logistic-regression churn-prediction classification-algorithm crossovers

Updated Jun 10, 2024
Jupyter Notebook

achuman1 / EDA-Restaurant-Cuisine-Ratings

Star

This project uses Exploratory Data Analysis (EDA) to uncover trends and insights from restaurant cuisine ratings, helping improve menus, enhance customer experiences, and guide targeted marketing strategies for business success.

visualization exploratory-data-analysis feature-selection feature-engineering

Updated Jun 10, 2024
Jupyter Notebook

Desbordante / desbordante-core

Star

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

data-science data-mining exploratory-data-analysis tabular-data feature-selection data-engineering feature-extraction data-analytics knowledge-discovery data-wrangling data-preprocessing feature-engineering spreadsheets data-exploration data-mining-algorithms data-cleaning data-profiling anomaly-detection data-cleansing correlations

Updated Jun 11, 2024
C++

alperbulbul1 / Feature-Selection-with-Firefly-Algorithm

Star

Feature selection with Firefly Algorithm

python machine-learning sklearn feature-selection firefly-algorithm metaheuristics

Updated Jun 10, 2024
Python

ai-on-browser / ai-on-browser.github.io

Star

Machine learning and data analysis package implemented in JavaScript and its online demo.

Updated Jun 11, 2024
JavaScript

alteryx / evalml

Star

EvalML is an AutoML library written in python.

data-science machine-learning optimization feature-selection model-selection feature-engineering hyperparameter-tuning automl

Updated Jun 11, 2024
Python

Nishant2018 / PCA-Feature-Selection-Scratch

Star

Principal Component Analysis (PCA) is a powerful dimensionality reduction technique commonly used in machine learning and data analysis. It transforms a dataset into a set of linearly uncorrelated variables called principal components.

machine-learning statistics linear-algebra feature-selection pca

Updated Jun 10, 2024
Jupyter Notebook

akanz1 / klib

Sponsor

Star

Easy to use Python library of customized functions for cleaning and analyzing data.

python data-science data-visualization feature-selection data-analysis klib data-preprocessing data-cleaning

Updated Jun 10, 2024
Python

DavideDevetak24 / Steel_Market_Study_DL

Star

The repository presents the notebooks and models used for my experimental thesis entitled: "Experimental Study of the Steel Market Through CNN-LSTM Deep Learning Models: Practical Applications for Cost Reduction in Industries"

machine-learning deep-neural-networks deep-learning feature-selection neural-networks feature-engineering keras-tensorflow stl-algorithms cnn-lstm loess cnn-lstm-models loess-smoothers seasonality-analysis

Updated Jun 10, 2024
Jupyter Notebook

harmanveer-2546 / Wafer-Fault-Detection

Star

The goal is to eliminate manual work in identifying faulty wafers. Opening and handling suspected wafers disrupts the entire process. False negatives result in wasted time, manpower, and costs.