You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This repository you are browsing contains intermediate level piece of codes which are useful for cleaning, exploratory analysis, handling of missing data points, outlier detection and different visualization techniques using graphics, ggplot2, tidycharts, ggExtra packages. Also in particular part of the script you can get basic information about…
This is a demonstration of using Spark to explore large dataset, by using PySpark and SparkR. The files include loading data, data exploration and using clustering on words of Shakespeare's novels.
Bi and Big Data Analytics, sparkR, Supervised and Unsupervised Machine Learning techniques The project's aim is of applying a supervised and an unsupervised machine learning technique on a dataset to test different models/scenario, interpret the results, perform predictions for each model and visualised the results.