Skip to content
#

clickstream-data

Here are 11 public repositories matching this topic...

This project focuses on analyzing Wikipedia's clickstream data to uncover patterns in how users navigate from one article to another. Utilizing Apache Spark and PySpark for data manipulation and analysis, the project aims to provide insights into user behavior on Wikipedia, including the most popular pathways to specific articles.

  • Updated Feb 15, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the clickstream-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the clickstream-data topic, visit your repo's landing page and select "manage topics."

Learn more