Update TNT-(S/B) model weights and add feature extraction support #2480

brianhou0208 · 2025-05-03T07:40:28Z

The main updates are as follows:

Update the weight TNT-(S/B) implemented with the official PyTorch
The original implementation is different from the official one (TNT Block & PixelEmbed), and the legacy parameter is used to maintain backward compatibility.
Support features_only parameters and forward_intermediates function

Example

model = timm.create_model(f'tnt_s_patch16_224', pretrained=True)
model = timm.create_model(f'tnt_b_patch16_224', pretrained=True)

model = timm.create_model(f'tnt_s_patch16_224', pretrained=False, legacy=True)
ckpt = torch.load('/path/to/original_weight.pth', map_location='cpu')
model.load_state_dict(ckpt)

Example(forward_intermediates)

model = timm.create_model(f'tnt_s_patch16_224', pretrained=True).eval()
output, intermediates = model.forward_intermediates(torch.randn(2,3,224,224))
for i, o in enumerate(intermediates):
    print(f'Feat index: {i}, shape: {o.shape}')

Feat index: 0, shape: torch.Size([2, 384, 14, 14])
Feat index: 1, shape: torch.Size([2, 384, 14, 14])
Feat index: 2, shape: torch.Size([2, 384, 14, 14])
Feat index: 3, shape: torch.Size([2, 384, 14, 14])
Feat index: 4, shape: torch.Size([2, 384, 14, 14])
Feat index: 5, shape: torch.Size([2, 384, 14, 14])
Feat index: 6, shape: torch.Size([2, 384, 14, 14])
Feat index: 7, shape: torch.Size([2, 384, 14, 14])
Feat index: 8, shape: torch.Size([2, 384, 14, 14])
Feat index: 9, shape: torch.Size([2, 384, 14, 14])
Feat index: 10, shape: torch.Size([2, 384, 14, 14])
Feat index: 11, shape: torch.Size([2, 384, 14, 14])

Example(features_only)

model = timm.create_model('tnt_s_patch16_224', features_only=True, pretrained=True)
print(f'Feature channels: {model.feature_info.channels()}')
print(f'Feature reduction: {model.feature_info.reduction()}')
output = model(torch.randn(2, 3, 224, 224))
for x in output:
    print(x.shape)

Feature channels: [384, 384, 384]
Feature reduction: [16, 16, 16]
torch.Size([2, 384, 14, 14])

Result

Model	FLOPs	MACs	Params	ACC@1	ACC@5	ckpt
tnt_b_patch16_224	28.11G	14.06G	65.43M	82.872	96.224	link
##original			65.41M			no weight link
tnt_s_patch16_224	10.44G	5.22G	23.77M	81.526	95.760	link
##original			23.76M	81.528	95.734	link

test code

from typing import Any, Dict, Union, List
from tqdm import tqdm
import torch
from torch.utils.data import DataLoader
import torchvision.datasets as datasets
import torchvision.transforms as transforms
import timm
from timm.utils.metrics import AverageMeter, accuracy

device = torch.device('mps')
torch.mps.empty_cache()

def auto_unit(x: float, unit: str = '') -> str:
    if x >= 1e9:
        return f"{x / 1e9:.2f}G {unit}"
    elif x >= 1e6:
        return f"{x / 1e6:.2f}M {unit}"
    elif x >= 1e3:
        return f"{x / 1e3:.2f}K {unit}"
    else:
        return f"{x:.2f} {unit}"
 
 
 def get_model_info(model: torch.nn.Module, imgsz: Union[int, List[int]] = 224) -> Dict[str, str]:
    """
    Compute model FLOPs, MACs, and Params using torch profiler.

    Args:
        model (nn.Module): The model to calculate for.
        imgsz (int | List[int], optional): Input image size. Defaults to 224.

    Returns:
        dict: Dictionary containing FLOPs, MACs, and Params with auto units.
    """
    p = next(model.parameters())
    if not isinstance(imgsz, list):
        imgsz = [imgsz, imgsz]

    im = torch.empty((1, 3, *imgsz), device=p.device)

    with torch.profiler.profile(with_flops=True) as prof:
        model(im)

    flops = sum(e.flops for e in prof.key_averages())
    macs = flops / 2
    params = sum(p.numel() for p in model.parameters())

    return {
        "FLOPs": auto_unit(flops, ""),
        "MACs": auto_unit(macs, ""),
        "Params": auto_unit(params, ""),
    }


def get_model_acc(model: torch.nn.Module):
    cfg: Dict[str, Any]= model.default_cfg
    _, height, width = cfg['input_size'] if 'test_input_size' not in cfg else cfg['test_input_size']
    crop_pct = cfg['crop_pct'] if 'test_crop_pct' not in cfg else cfg['test_crop_pct']
    imgsz = height if height == width else (height, width)
    interp_mode = {"nearest": 0, "bilinear": 2, "bicubic": 3}

    val_dataset = datasets.ImageFolder(
        './/imagenet/val',
        transforms.Compose([
            transforms.Resize(int(imgsz / crop_pct), interpolation=interp_mode[cfg['interpolation']]),
            transforms.CenterCrop(imgsz),
            transforms.ToTensor(),
            transforms.Normalize(cfg['mean'], cfg['std'])])
    )
    val_loader = DataLoader(
        val_dataset, batch_size=64, shuffle=False, pin_memory=False, prefetch_factor=4, num_workers=4,
        persistent_workers=True#, pin_memory_device='mps'
    )

    top1 = AverageMeter()
    top5 = AverageMeter()

    model.eval()
    model.to(device)
    torch.mps.synchronize()
    with torch.no_grad():
        for images, target in tqdm(val_loader):
            images = images.to(device)
            target = target.to(device)
            output = model(images)
            acc1, acc5 = accuracy(output, target, topk=(1, 5))
            top1.update(acc1, images.size(0))
            top5.update(acc5, images.size(0))
    torch.mps.synchronize()
    return {"ACC@1": round(top1.avg.item(), 4), "ACC@5": round(top5.avg.item(), 4)}
 
 
 if __name__ == "__main__":
    model = timm.create_model(f'tnt_s_patch16_224', pretrained=True).eval()
    result = get_model_acc(model)
    print(result)
    model = timm.create_model(f'tnt_b_patch16_224', pretrained=True).eval()
    result = get_model_acc(model)
    print(result)

>>{'ACC@1': 81.526, 'ACC@5': 95.76}
>>{'ACC@1': 82.872, 'ACC@5': 96.224}

Reference

official PyTorch implement: https://github.com/huawei-noah/Efficient-AI-Backbones/tree/master/tnt_pytorch

HuggingFaceDocBuilderDev · 2025-05-05T18:56:48Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

timm/models/tnt.py

…ompat

brianhou0208 added 4 commits May 2, 2025 20:34

Update tnt.py

b37f0f7

Support features_only

848b8c3

Fix default_cfgs

fc0b6ad

Fix checkpoint_filter_fn

37bbac1

brianhou0208 commented May 6, 2025

View reviewed changes

timm/models/tnt.py Show resolved Hide resolved

brianhou0208 and others added 3 commits May 11, 2025 22:45

Merge branch 'main' into tnt

69b1fbc

Updated tnt model weights on hub, add back legacy model in case bwd c…

74ad32a

…ompat

Fix torchscript issue with legacy tnt

16d0b26

rwightman merged commit 6b302f2 into huggingface:main May 14, 2025
22 checks passed

brianhou0208 deleted the tnt branch May 15, 2025 17:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Update TNT-(S/B) model weights and add feature extraction support #2480

Update TNT-(S/B) model weights and add feature extraction support #2480

Uh oh!

brianhou0208 commented May 3, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented May 5, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Update TNT-(S/B) model weights and add feature extraction support #2480

Update TNT-(S/B) model weights and add feature extraction support #2480

Uh oh!

Conversation

brianhou0208 commented May 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Example

Example(forward_intermediates)

Example(features_only)

Result

Reference

Uh oh!

HuggingFaceDocBuilderDev commented May 5, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

brianhou0208 commented May 3, 2025 •

edited

Loading