Translution: Unifying Self-attention and Convolution for Adaptive and Relative Modeling

⚠️ Please note that a full Translution Neural Network requires a large amount of GPU memory—beyond what most current devices can provide. However, you can replace individual Self-Attention layers with Translution in Transformers, which may yield surprisingly performance improvements.

Code Index

Image (2D): • Translution • LoR-Translution
Language (1D): • Translution • LoR-Translution

Abstract

When modeling a given type of data, we consider it to involve two key aspects: 1) identifying relevant elements (e.g., image pixels or textual words) to a central element, as in a convolutional receptive field, or to a query element, as in self-attention, and 2) encoding these tokens effectively. Self-attention can adaptively identify these elements but relies on absolute positional embedding for structural representation learning. In contrast, convolution encodes elements in a relative manner, yet their fixed kernel size limits their ability to adaptively select the relevant elements. Translution unifies the adaptive identification capability of self-attention and the relative encoding advantage of convolution.

Related Repos

ViT: https://github.com/lucidrains/vit-pytorch
nanoGPT: https://github.com/karpathy/nanoGPT

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
GPT		GPT
ViT		ViT
imgs		imgs
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Translution: Unifying Self-attention and Convolution for Adaptive and Relative Modeling

Code Index

Abstract

Related Repos

About

Uh oh!

Releases

Packages

Languages

License

hehefan/Translution

Folders and files

Latest commit

History

Repository files navigation

Translution: Unifying Self-attention and Convolution for Adaptive and Relative Modeling

Code Index

Abstract

Related Repos

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages