Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make correlation matrix to assess the relationship between the different variables in the dataset. #40

Open
4 tasks
jacobqs opened this issue Jan 27, 2023 · 0 comments
Milestone

Comments

@jacobqs
Copy link
Owner

jacobqs commented Jan 27, 2023

  • Begin by calculating the Pearson’s correlation coefficient for each pair of variables. This can be done using a statistical software package or by hand.

  • Create a matrix of correlation coefficients by pairing each variable with all other variables in the dataset.

  • Visualize the data by graphing the correlation coefficients in a heatmap. This will help you to identify any strong correlations between the variables.

  • Interpret the correlations and identify any possible relationships between the variables. You can also use this matrix to identify any multicollinearity issues, which can be a problem when building predictive models.

@jacobqs jacobqs added this to the Data analysis milestone Jan 27, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant