Skip to content

Northwestern University PREDICT 411: Predictive Modeling II

Notifications You must be signed in to change notification settings

andrewgdunn/predict411

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

Artifacts from completing Northwestern University Predict 411

This course extends linear “OLS” regression by introducing the concept of
Generalized Linear Model “GLM” regression. The course reviews traditional linear
regression and then continues with logistic regression, poisson regression, and
survival analysis. The course is heavily weighted towards practical application
with large data sets containing missing values and outliers. It addresses issues
of data preparation, model development, model validation, and model deployment.

I'm sharing artifacts from my completion of the course with the intent that someone may benefit from the approach taken. I'm sharing everything that I think is reasonable without treading on the publishers of reference texts, and the academic institution. I'd ask that if you've come upon this repository and use it for reference, please contact me and let me know what helped you. Alternatively, if you've found this and believe I'm in violation for sharing the material, please contact me first rather than issue a take-down.

I've deliberately included the data along with the source code so that the work can be reproduced, however I've also omitted the course instruction as it is likely to cause a claim from the institution. It is also likely that the course will significantly change as time goes on, so the value of this reference is limited. However, I've personally found references from decades prior while working on this course, and felt that if I could benefit anyone I should share.

The overall caveat is that I'm just a student. Typically out of time, and under duress to complete this course material. I don't posit that this is a correct reference, rather, I know there are significant shortcomings. Read everything as a skeptic, and think critically rather than directly re-use something that I have done.

Northwestern's [Predictive Analytics](http://sps.northwestern.edu/program- areas/graduate/predictive-analytics/) program was the first that I found to provide full degree accreditation from a completely on-line program with curriculum vectoring towards becoming a practitioner of Data Science. I'm excited to participate in the program and hope that the university doesn't take offense in me sharing this coursework.

I would like to see higher education adopt a more open form of education, much like what we're seeing in the massively on-line open courses of late. I further would like to see institutions choose to use libre/open technology in their curriculum, as these tools are forms of expression that become unavailable to students after separation with the institution. Where possible I've tried to use or reproduce work in libre/open form, however time is always limited.

andrew.g.dunn@u.northwestern.edu

About

Northwestern University PREDICT 411: Predictive Modeling II

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published