EmotionTeller

The Erdős Institute Deep Learning Bootcamp Summer 2025

Team Members:

Overview

Facial expression recognition is a crucial component for improving human-AI interaction and represents a core problem in computer vision, with applications in areas such as image captioning and behavioral analysis. The goal of this project is to develop a model which is capable of detecting faces and classifying emotional expressions accurately, particularly in group image settings. To achieve this, the project employs two complementary deep learning approaches: a fine-tuned YOLO model for a single step face and emotion detection, and a two-stage framework that integrates an automatic face detector with a fine-tuned emotion classifier for refined prediction.

Dataset

221 images of people in natural, unconstrained environments are taken from “Human Group Emotions Labelled” (1) and “Emotic” (2) datasets combined. We customized labels and facial positions to these 221 images. We separated 43 images from this set for final testing for both of the models. (Train Set: 178 Photos, ~1100 labelled people with emotion labels and bbox information; Final Test Set: 43 Photos, ~260 labelled people with emotion labels and bbox information)
For the two-stage model, in addition, we added single face photos from “FACES database” (3) and a subset of “RAF database” (4). (Single face photos: ~72 images from FACES database, ~1900 images from RAF database)

Data Resources:

Human Group Emotions Labelled: A dataset of 162 images with multiple labeled bounding boxes and one of seven discrete emotion labels per face. Focuses on group-level emotion localization in cluttered scenes: link
Emotic: 23,571 images of people in natural, unconstrained environments. Annotated with 26 categorical emotions and continuous Valence–Arousal–Dominance (VAD) scores, capturing contextual cues like body language and scene type
FACES database: 72 images of naturalistic faces of young, middle, old women and men displaying each of six facial expressions: neutrality, sadness, disgust, fear, anger, and happiness. link
The Real-world Affective Faces Database (RAF-DB): link

Models

YOLOv11m: This is a state-of-the-art model for object detection. Initially trained on the COCO dataset to detect over 80 different kinds of objects. We finetuned this model using dataset (1) and (2).
YOLOv11m-face: Similar to the previous model but trained specifically to detect faces in crowds. Once again, we finetuned this model using dataset (1) and (2).
Two-Stage: This model begins with an object detection model to identify the faces in every picture which is trained using dataset (1) and (2). Then, to every cropped picture, a classification model trained on dataset (3) and (4) is applied. We have two versions of this model, the first one uses OpenCV DNN SSD for object detection whereas the second one uses Yolo11n-face. Both versions use Resnet18 for classification.

App

We build an app using streamlit where you can either upload or capture an image and choose one of our models. It outputs an annotated picture along with a dataframe with information about the bounding boxes and labels.

Repository Structure:

EmotionTeller/
├── app/
│   ├── app.py                          # Streamlit interface for running models
│   ├── emotion_teller_demo.ipynb       # Notebook version of the demo UI
│   ├── inputs/                         # Latest inputs and outputs of the app
│   │   ├── annotated_output_image.png  # Most recent annotated image
│   │   ├── faces_data.csv              # Latest output dataframe
│   │   └── input_image.png             # Latest input image to the model
│   ├── two_step_model/                 # Two-stage detector + classifier package
│   │   ├── BaselineModels/             # Pretrained weights and configs
│   │   │   ├── best_overall.pt         # ResNet18 classifier checkpoint
│   │   │   ├── deploy.prototxt         # SSD face detector config
│   │   │   ├── res10_300x300_ssd_iter_140000.caffemodel  # SSD detector weights
│   │   │   └── yolo11n-face-best.pt    # YOLO-based face detector weights
│   │   ├── evaluation_pipeline_map.py  # Evaluation helpers and metrics mapping
│   │   ├── two_step_pipeline.py        # Orchestrates end-to-end pipeline
│   │   └── two_step_pipeline_example.md   # Detailed docs for the two-step model
│   └── yolo_model/                     # Fine-tuned YOLO single-stage models
│       ├── utils.py                    # Shared inference utilities
│       ├── yolov11m.py                 # Standard YOLOv11m wrapper
│       └── yolov11m_face.py            # Face-focused YOLOv11m wrapper
├── Metadata/                           # Dataset metadata tables
│   ├── test_meta.csv                   # Test split annotations
│   └── train_meta.csv                  # Training split annotations
├── YOLO_training/                      # Training logs and outputs for YOLO
│   └── runs/                           # Experiments, metrics, and weights
├── ClassificationBaseline_V3.ipynb     # Emotion classification baseline notebook
├── DataSplit.ipynb                     # Data partitioning exploration
├── DetectionYOLO.ipynb                 # YOLO detection experiments
├── main.ipynb                          # Project summary and experiment notes
├── TwoStepWorkflow.ipynb               # Two-stage pipeline prototyping
└── utilsJ.py                           # Shared helper utilities

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EmotionTeller

Overview

Dataset

Data Resources:

Models

App

Repository Structure:

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
Metadata		Metadata
YOLO_training		YOLO_training
app		app
.gitignore		.gitignore
ClassificationBaseline_V3.ipynb		ClassificationBaseline_V3.ipynb
DataSplit.ipynb		DataSplit.ipynb
DetectionYOLO.ipynb		DetectionYOLO.ipynb
README.md		README.md
TwoStepWorkflow.ipynb		TwoStepWorkflow.ipynb
YOLOfinetune.ipynb		YOLOfinetune.ipynb
app_pic.png		app_pic.png
main.ipynb		main.ipynb
utilsJ.py		utilsJ.py

aiqicheng/EmotionTeller

Folders and files

Latest commit

History

Repository files navigation

EmotionTeller

Overview

Dataset

Data Resources:

Models

App

Repository Structure:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages