Skip to content

Different clustering and clustering metrics are implemented in this repository

License

Notifications You must be signed in to change notification settings

duttaprasanta/clustering

Repository files navigation

Clustering

In this repository, the following classes and functions are implemented

The classes and functions are distributed into the following modules -

  • cluster : It contains KMeans and Agglomerative classes
  • metrics : It contains Silhouette score and Jaccard Index computation functions
  • utils : It contains a function named export which is used to export the results of clustering into a file

Usage

To know how to use them, go through driver.ipynb

Running the codes

Step 1

Clone this repository with git clone https://github.com/duttaprasanta/clustering.git. You may also download this repository as zip file and extract it.

Step 2

Run the python files by installing required packages. For running the ipynb files you need to install jupter notebook or jupyter lab or visual studio code (an extension is needed). You may also run it by uploading into Google Colab.

If you stuck, with dependency conflicts, you can create a virtual environment and activate it by doing the following -

pip install virtualenv
virtualenv env
source env/bin/activate
pip install -r requirements.txt

Deactivate all the activated environment before doing the previous step. In case of conda, deactivate it using conda deactivate. In case of pip, do deactivate.

Useful links