Comparison of model selection methods for Boston dataset (in MASS package)

The performance of four selections methods, Best subsets, Ridge regression, Lasso regression, and Manual selection, have been compared using the ‘Boston’ dataset included in the MASS package. The model objective is to determine the relationship between per capita crime rate, ‘crim’, on other 13 predictors where 2 are categorical variables with 2 levels and 9 levels each. First, an exploratory data analysis is performed to analyze the distribution of variables and to investigate preliminary relationship between ‘crim’ and predictors. Due to high skewedness of variables leading to violations of linear regression assumptions, transformed variables are used throughout the model selection process. The Best subsets elminated 6 predictors and had the highest prediction accuracy followed by Manual selection method. The increase in model bias through Ridge or Lasso regression did not result in significant improvement in prediction accuracy for the transformed variables.

File Description

Project1.pdf : Project report file (PDF)
Project.R : R code script

You can view the Project Report in HTML by clicking here.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.gitattributes		.gitattributes
LICENSE		LICENSE
Project1.R		Project1.R
Project1.html		Project1.html
Project1.pdf		Project1.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitattributes

.gitattributes

LICENSE

LICENSE

Project1.R

Project1.R

Project1.html

Project1.html

Project1.pdf

Project1.pdf

README.md

README.md

Repository files navigation

Comparison of model selection methods for Boston dataset (in MASS package)

File Description

About

Releases

Packages

Languages

License

gapkim/Boston_Dataset

Folders and files

Latest commit

History

Repository files navigation

Comparison of model selection methods for Boston dataset (in MASS package)

File Description

About

Topics

Resources

License

Stars

Watchers

Forks

Languages