Autoregressive Transformer Model

This is a simple and easy-to-implement autoregressive Transformer model for sequence generation tasks such as SMILES generation. It is implemented entirely in PyTorch with minimal dependencies.

Features

Clean and lightweight implementation
Supports autoregressive (causal) sequence modeling
Includes loss computation and sampling functions
Automatically selects device (MPS, CUDA, or CPU)
Easy to integrate with any custom vocabulary

Model Overview

The model uses:

Token and positional embeddings
A Transformer encoder with causal masking
A linear output layer projecting to vocabulary logits

The TransformerModel class wraps this architecture and provides:

compute_loss() for training
sample() for autoregressive generation

Requirements

Python 3.8+
PyTorch
RDKit (optional, for SMILES visualization)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
Voc.txt		Voc.txt
data_structs.py		data_structs.py
inference.ipynb		inference.ipynb
model.py		model.py
molecules.png		molecules.png
requirements.txt		requirements.txt
smiles_generator.ckpt		smiles_generator.ckpt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Autoregressive Transformer Model

Features

Model Overview

Requirements

About

Uh oh!

Releases

Packages

Languages

License

Kelu01/smiles-gen

Folders and files

Latest commit

History

Repository files navigation

Autoregressive Transformer Model

Features

Model Overview

Requirements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages