Privacy-Preserving Social Network Clustering Using Differential Privacy

In the contemporary landscape of online social networks, preserving users' privacy while applying clustering techniques is a pivotal concern. This project integrates differential privacy into social network clustering to balance user privacy and clustering effectiveness. Through a detailed exploration of differential privacy parameters, this work provides insights into how privacy levels influence clustering accuracy and offers a comprehensive understanding of the relationship between privacy, data utility, and clustering in social networks.

Published

This research was published in the 2024 International Conference on Smart Systems for Electrical, Electronics, Communication, and Computer Engineering (ICSSEECC).

Key Features

Integration of differential privacy with social network clustering.
K-means clustering on a noisy feature matrix generated by Laplace noise.
Evaluation of epsilon parameter impacts on privacy and clustering performance.
Detailed graphical analysis and evaluation metrics.

Technologies / Libraries Used 🛠️

Python (3.8+)
numpy - Numerical computations
networkx - Graph and network analysis
matplotlib - Data visualization
sklearn - Machine learning tools
- KMeans - Clustering algorithm
- adjusted_rand_score - Evaluation metric
- silhouette_score - Evaluation metric
- davies_bouldin_score - Evaluation metric
Jupyter Notebook

Evaluation Metric Values 📈

Privacy Parameter (epsilon): Varied from 0.1 to 3 in increments of 0.1.
Optimal Epsilon Value: At an epsilon of 2.2, the clustering accuracy peaks at 80.53%, achieving a balance between privacy and effectiveness.
Detailed visualizations of the epsilon vs. metric relationship, highlighting the privacy-utility trade-offs.

Dataset 📊

Source: Twitter Social Network
Description: Contains user profiles, follower/friend lists, and interaction subgraphs.
Twitter Dataset

Installation

Clone the repository:

git clone https://github.com/KavinAravindhan/privacy-preserving-clustering.git

Install the necessary libraries:
```
pip3 install -r requirements.txt
```
Run Jupyter notebooks as per the analysis workflow.

Files

data_preprocessing.ipynb - Data preprocessing and cleaning.
k-means_clustering.ipynb - Initial clustering on raw data.
differential_privacy.ipynb - Adding differential privacy to data and clustering again.
accuracy_metrics.ipynb - Evaluation of clustering results using various metrics.
graphical_analysis.ipynb - Graphical analysis of metrics across different privacy parameter values.
privacy_preserving_clustering.ipynb - Comprehensive notebook with all steps combined.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Team Acknowledgment 🙌

A special thanks to our amazing team for their dedication and hard work. Despite the challenges, their commitment to learning new technologies and collaborating effectively made this project a success.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
paper		paper
.DS_Store		.DS_Store
.gitignore		.gitignore
1-data_preprocessing.ipynb		1-data_preprocessing.ipynb
2-k-means_clustering.ipynb		2-k-means_clustering.ipynb
3-differential_privacy.ipynb		3-differential_privacy.ipynb
4-accuracy_metrics.ipynb		4-accuracy_metrics.ipynb
5-graphical_analysis.ipynb		5-graphical_analysis.ipynb
LICENSE		LICENSE
README.md		README.md
privacy_preserving_clustering.ipynb		privacy_preserving_clustering.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Privacy-Preserving Social Network Clustering Using Differential Privacy

Published

Key Features

Technologies / Libraries Used 🛠️

Evaluation Metric Values 📈

Dataset 📊

Installation

Files

License

Team Acknowledgment 🙌

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Privacy-Preserving Social Network Clustering Using Differential Privacy

Published

Key Features

Technologies / Libraries Used 🛠️

Evaluation Metric Values 📈

Dataset 📊

Installation

Files

License

Team Acknowledgment 🙌

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages