Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View linty5's full-sized avatar

Highlights

  • Pro

Block or report linty5

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TransNet: A deep network for fast detection of common shot transitions

Jupyter Notebook 60 13 Updated Jun 8, 2020

TransNet V2: Shot Boundary Detection Neural Network

Python 826 130 Updated Dec 4, 2023

AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection - CVPR NAS 2023

Python 207 19 Updated Apr 18, 2023

Large-scale, Fast and Accurate Shot Boundary Detection through Spatio-temporal Convolutional Neural Networks

MATLAB 70 9 Updated Oct 9, 2020

ClipShots is the first large-scale dataset for shot boundary detection collected from Youtube and Weibo covering more than 20 categories, including sports, TV shows, animals, etc.

Python 122 16 Updated Nov 9, 2021

Official implementation of "Implicit Neural Representations with Periodic Activation Functions"

Python 1,934 267 Updated Jul 27, 2024

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,876 1,335 Updated Aug 14, 2024

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Python 2,126 626 Updated Aug 9, 2023

Skeleton Recall Loss for Connectivity Conserving and Resource Efficient Segmentation of Thin Tubular Structures

Python 124 11 Updated Dec 8, 2025

A collection of loss functions for medical image segmentation

Python 3,989 614 Updated Nov 1, 2023

The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

Python 9,613 1,608 Updated Jun 26, 2024

[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box

Python 5,963 1,078 Updated Jun 19, 2024

The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and paragraph level annotations.

Jupyter Notebook 301 28 Updated Dec 2, 2024

This is the official repository for our ECCV 2022 paper titled, "The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video Editing"

Python 54 2 Updated Nov 28, 2022

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 2,074 139 Updated Dec 18, 2025

publications, experiments, reports, etc.

Python 1 1 Updated May 18, 2022

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 3,186 365 Updated Nov 11, 2025

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Python 1,074 44 Updated Jan 21, 2025

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,546 984 Updated Aug 12, 2024

(Pattern Recognition) Pytorch implementation of “HTR-VT: Handwritten Text Recognition with Vision Transformer”

Python 114 16 Updated Oct 24, 2025

[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective

Python 200 9 Updated Nov 1, 2023

Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining

Python 352 38 Updated Nov 29, 2023

A PyTorch implementation of "TextFuseNet: Scene Text Detection with Richer Fused Features".

Python 485 122 Updated Jul 2, 2021

A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition

Python 194 23 Updated Oct 13, 2025

Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition

Jupyter Notebook 457 75 Updated Oct 14, 2022

An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".

Python 146 20 Updated Nov 14, 2025

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 12,210 1,153 Updated Dec 22, 2025
Next