zoo - Deep Learning Models and Experiments

A collection of Deep Learning models and experiments.

Overview

This repository contains:

Custom implementation of different architectures.
Training pipelines for different modalities.
Docker scripts for reproducible experiments.

Models

Dynamic Tanh (DyT)

A custom normalization layer from "Transformers without Normalization" that leverages a parameterized tanh function. This can be used as an alternative to Normalization Layers.

Vision Transformer (ViT)

A minimal implementation of the Vision Transformer architecture as described in the "An Image is Worth 16x16 Words" paper.

Unet3D

U-Net based model using 3D convolutions for regression on spatiotemporal data. Follows the implementation proposed in "3D U-Net: Learning Dense Volumetric Segmentation from Sparse Annotation" and includes residual connection in the encoder and decoder blocks.

Experiments

LayerNorm vs Dynamic Tanh (DyT) Normalization in Small Vision Transformers: Includes a study of the loss and accuracy dynamics when training small ViT models from scratch on tiny-imagenet-200 with different normalization approaches. It also includes a brief time analysis comparing the performance of RMSNorm, LayerNorm, and DyT.

Structure

zoo/
├── configs/              # Training configuration files
├── data_utils/           # Data loading and augmentation utilities
├── models/               # Model and Layers implementations
├── docker-build.sh       # Script to build Docker container
├── docker-run.sh         # Script to run Docker container
├── Dockerfile            # Docker configuration
├── train_*.py            # Training script(s)
└── utils.py              # Utility functions
└── vis.py                # Visualization functions

License

This project is licensed under the MIT License, see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

zoo - Deep Learning Models and Experiments

Overview

Models

Dynamic Tanh (DyT)

Vision Transformer (ViT)

Unet3D

Experiments

Structure

License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
configs		configs
data_utils		data_utils
models		models
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-build.sh		docker-build.sh
docker-run.sh		docker-run.sh
train_imagenet.py		train_imagenet.py
train_movingmnist.py		train_movingmnist.py
utils.py		utils.py
vis.py		vis.py

Uh oh!

License

Uh oh!

guillesanbri/zoo

Folders and files

Latest commit

History

Repository files navigation

zoo - Deep Learning Models and Experiments

Overview

Models

Dynamic Tanh (DyT)

Vision Transformer (ViT)

Unet3D

Experiments

Structure

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages