ThinkNLP 🧠💬

Interactive NLP Learning Platform for Beginners

📚 Project Overview

ThinkNLP is an educational web application designed to help beginners in Natural Language Processing (NLP) understand the full pipeline of sentiment and topic analysis using real-world review data. It provides a step-by-step, no-code interface to interactively explore how NLP models work.

✨ Features

Full NLP pipeline walkthrough
Upload and process real review data
Compare sentiment models (VADER, TextBlob, BERT)
Topic modeling with LDA and interactive visualization (pyLDAvis)
Manual and auto topic labeling
Sentiment distribution per topic
Beginner-friendly UI with visual explanations

⚙️ NLP Pipeline

Upload Review File
- Supports CSV input, stored in AWS S3 (gzip compressed)
Data Cleaning
- Normalization: Lowercasing, typo correction
- Special character removal: special character, number, and emoji
- Tokenization: Breaking sentences into words
- Stopword Removal: remove common words that not provide useful context
- Lemmatization: Converting words to their base forms
EDA (Exploratory Data Analysis)
- Word clouds, frequency charts, sentence length plots
Topic Modeling
- LDA with auto or manual topic count
- pyLDAvis visualization
Topic Labeling
- Manual, keyword-based, or auto-inferred labels
Sentiment Analysis
- VADER: Rule-based model
- TextBlob: Lexicon-based
- BERT: Transformer-based classifier (optional)
Sentiment-Topic Mapping
- Each sentence assigned a dominant topic
- Sentiment computed per topic
- Output: Sentiment distribution per topic (Positive, Neutral, Negative)

🧱 Architecture

Frontend: React + TanStack Query (Vercel)
Backend: FastAPI + PostgreSQL (Docker, DigitalOcean)
Storage: AWS S3 for file uploads
Monitoring: BetterStack for logs and metrics

_{Diagram illustrating request flow, infrastructure, CI/CD, and observability for ThinkNLP.}

🛡️ Security & Optimization

Gzip file compression
Rate limiting & security headers
Future: Background processing with Celery

🚧 Future Roadmap

✅ User authentication & file history
✅ Background task support (Celery + status UI)
✅ Expanded model selection and interpretability features
✅ Beginner tutorials and automatic result summaries
✅ Support with Multiple Languages beside English Language

👩‍🎓 Target Audience

NLP beginners and students
Educators and instructors
Developers interested in NLP and no-code tools

🧪 Getting Started

Prerequisites

Docker
Python 3.11

Backend Setup

Create a Python virtual environment if you haven’t already:

python -m venv .venv
source .venv/bin/activate

Then run:

cd root_project
cp .env.example .env
make migrate        # Setup initial database schema
make up-local       # Run the backend server

Other helpful Makefile commands:

make reset-all      # Reset database and volumes
make logs-local     # View backend logs
make lint           # Run linter (ruff)
make test           # Run backend tests with coverage
...                 # You can check more on Makefile file

Frontend Setup (Separate Repo)

git clone https://github.com/sokritha-dev/think-nlp-frontend.git
cd root_project
yarn add
yarn dev

📁 Folder Structure

think-nlp/
├── .github/workflows/        # GitHub Actions workflows
├── .vscode/                  # VSCode editor settings
├── app/                      # FastAPI application code
├── k8s/                      # Kubernetes manifests
├── metric/                   # Monitoring & metrics utilities
├── migrations/               # Alembic migration files
├── reports/                  # Load test and analysis reports
├── scripts/                  # Helper and automation scripts
├── .autoenv.zsh              # Autoenv activation for Zsh
├── .dockerignore             # Docker ignore rules
├── .env.sample               # Example environment variables
├── .gitignore                # Git ignore rules
├── .python-version           # Python version pinning
├── Dockerfile                # Production Dockerfile
├── Dockerfile.dev            # Development Dockerfile
├── LICENSE                   # MIT License
├── Makefile                  # CLI automation for dev/test/deploy
├── alembic.ini               # Alembic configuration
├── docker-compose.*.yml      # Docker Compose files for different envs
├── locustfile.py             # Locust load testing script
├── pytest.ini                # Pytest config
├── requirements.txt          # Production dependencies
├── requirements-dev.txt      # Development dependencies

📝 License

This project is licensed under the MIT License.

❤️ Acknowledgements

Built using FastAPI, React, and pyLDAvis
NLP components inspired by open-source models and tutorials

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ThinkNLP 🧠💬

📚 Project Overview

✨ Features

⚙️ NLP Pipeline

🧱 Architecture

🛡️ Security & Optimization

🚧 Future Roadmap

👩‍🎓 Target Audience

🧪 Getting Started

Prerequisites

Backend Setup

Frontend Setup (Separate Repo)

📁 Folder Structure

📝 License

❤️ Acknowledgements

About

Uh oh!

Releases 15

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
.github/workflows		.github/workflows
.vscode		.vscode
app		app
docs		docs
grafana		grafana
k8s		k8s
migrations		migrations
reports		reports
scripts		scripts
.autoenv.zsh		.autoenv.zsh
.dockerignore		.dockerignore
.env.sample		.env.sample
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
Dockerfile.dev		Dockerfile.dev
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
alembic.ini		alembic.ini
docker-compose.development.yml		docker-compose.development.yml
locustfile.py		locustfile.py
otel-collector-config.yml		otel-collector-config.yml
prometheus.yml		prometheus.yml
pytest.ini		pytest.ini
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

License

sokritha-dev/think-nlp

Folders and files

Latest commit

History

Repository files navigation

ThinkNLP 🧠💬

📚 Project Overview

✨ Features

⚙️ NLP Pipeline

🧱 Architecture

🛡️ Security & Optimization

🚧 Future Roadmap

👩‍🎓 Target Audience

🧪 Getting Started

Prerequisites

Backend Setup

Frontend Setup (Separate Repo)

📁 Folder Structure

📝 License

❤️ Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 15

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages