Howdy, my name is Josh. I created the Georgia State University - Data Science (GSU-DS) Organization to organize our Senior Projects.
Why this organization exists:
- Fairness & Attribution: This org allows all contributors to highlight these projects on their own portfolios without needing to fork them.
- Future Academic Utility: The name
GSU-DSis intentionally designed so that a GSU Professor may take over this organization in the future. - The End Goal: To hand off this org to university faculty to serve as a hub to organize student projects and display student work centrally.
This organization serves as the central hub for the following Summer/Fall 2025 Senior Projects.
Please navigate to the individual repositories for specific documentation and codebases.
All contributors are listed alphabetically.
| Project Name (Repo) | Course | Semester | Description | Contributor(s) | Tech Stack |
|---|---|---|---|---|---|
| Campus Burglary Risk Prediction | Data Science Capstone | Fall 2025 | This project focuses on forecasting larceny risks across Atlanta’s 25 NPUs to improve campus safety. | Gunn Madan, Harini Mohan, Joshua Piña, Yuntian 'Robin' Wu | PyTorch, LSTM, GeoPandas, Streamlit |
| Ticket-Heroes | Fundamentals of Data Science | Fall 2025 | An application aimed at tracking ticket resale prices and predicting future fluctuation through feature-based regression and time series analysis. | Harini Mohan, Joshua Piña | Scikit-Learn, XGBoost, KNN, MongoDB |
| Mining Patterns from Large-Scale Flight Data | Data Mining | Fall 2025 | This project aims to extract knowledge from large-scale flight records to predict arrival delays. | Joshua Piña, Yuntian 'Robin' Wu | Scikit-learn, Random Forest, Plotly, Streamlit |
| Living Library | Database Systems | Fall 2025 | A resource-stacked database charged with assisting the training of future DS students. | Joshua Piña | Hugging Face, FastAPI Supabase (PostgreSQL) |
| Everything is a Graph | Design & Analysis: Algorithms | Fall 2025 | This project uses Edmonds-Karp to create an image segmentation tool. | Joshua Piña | NumPy, OpenCV, MatPlotLib |
| ML-HNSCC Study | Machine Learning | Summer 2025 | Exploratory ML to improve head and neck cancer treatment plans, with a focus on maximizing KBRT. | Abdul-Malik Mohamed, Joshua Piña, Somiya Rauf | PyTorch, XGBoost, MATLAB Flask |
Note: Click the project names above to view their respective repositories.
This section guides new developers or team members on how to set up their environment to contribute to GSU-DS projects.
Most projects in this organization utilize a standard Data Science stack. Ensure you have the following installed:
- Git
- Anaconda or Miniconda (Recommended for environment management)
- VS Code or JupyterLab
-
Clone the Repository: Navigate to the specific project you wish to work on and clone it:
git clone [https://github.com/GSU-DS/](https://github.com/GSU-DS/)[repository-name].git cd [repository-name] -
Create Virtual Environment: We recommend creating a fresh environment for each project to avoid dependency conflicts.
conda create -n gsu-ds-project python=3.9 conda activate gsu-ds-project
-
Install Dependencies:
pip install -r requirements.txt
To ensure quality across all 5 repositories, we follow this general workflow:
- Branching: Do not push directly to
main. Create a feature branch:git checkout -b feature/analysis-name - Commits: Write clear, descriptive commit messages.
- Pull Requests: Submit a PR for review before merging into the main branch.
- @joshuapina - Lead Maintainer / Program Manager
We welcome contributions from GSU students and faculty.
- Fork the project.
- Create your feature branch (
git checkout -b feature/AmazingFeature). - Commit your changes (
git commit -m 'Add some AmazingFeature'). - Push to the branch (
git push origin feature/AmazingFeature). - Open a Pull Request.
This project is licensed under the MIT License - see the LICENSE.md file for details.