Skip to content

In this notebook, we're going to go through an example machine learning project with the goal of predicting the sale price of bulldozers.

Notifications You must be signed in to change notification settings

mebestaca/bulldozer-price-predict-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 

Repository files navigation

Predicting the Sale Price of Bulldozer using Machine Learning

In this notebook, we're going to go through an example machine learning project with the goal of predicting the sale price of bulldozers.

1. Problem Definition

How well can we predict the future sale price of a bulldozer, given its characteristics and previous examples of how much similar bulldozers have been sold for?

2. Data

The data is downloaded from the Kaggle Blue Book for Bulldozers competition: https://www.kaggle.com/c/bluebook-for-bulldozers/data

There are 3 main datasets:

  • Train.csv is the training set, which contains data through the end of 2011.

Valid.csv is the validation set, which contains data from January 1, 2012 - April 30, 2012 You make predictions on this set throughout the majority of the competition. Your score on this set is used to create the public leaderboard.* Test.csv is the test set, which won't be released until the last week of the competition. It contains data from May 1, 2012 - November 2012. Your score on the test set determines your final rank for the competitio

3. Evaluation

The evaluation metric for this competition is the RMSLE (root mean squared log error) between the actual and predicted auction prices.

For more on evaluation on the evaluation of this project check: https://www.kaggle.com/c/bluebook-for-bulldozers/overview

Note The goal for more most regression evaluation metrics is to minimize the error. For example, our goal for this project will be to build a machine learning model which minimizes RMSLE.

4. Features

Kaggle provides a data dictionary detailing all of the features of the dataset. You can view this data dictionary on Google Sheets: https://docs.google.com/spreadsheets/d/16XdrpMWuITqXuyDHgUt3pBnChWM4M0mo/edit?usp=sharing&ouid=109881050709779521166&rtpof=true&sd=truen.

About

In this notebook, we're going to go through an example machine learning project with the goal of predicting the sale price of bulldozers.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published