Stars
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
A lightweight and robust Python eye tracker
A feature-rich command-line audio/video downloader
[.NET] m3u8 downloader 开源的命令行m3u8/HLS/dash下载器,支持普通AES-128-CBC解密,多线程,自定义请求头等. 支持简体中文,繁体中文和英文. English Supported.
Noisy-LSTM: Improving Temporal Awareness for Video Semantic Segmentation
LongShortNet for Streaming Perception task.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model. IEEE Transactions on Image Processing (2018)
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
谷歌翻译服务器在中国大陆的IP地址扫描、测速工具。
Predicting and estimating camera motion in endoscopic interventions
[MedIA2022 & ICRA2021] Self-Supervised Monocular Depth and Ego-Motion Estimation in Endoscopy: Appearance Flow to the Rescue
Hybrid Neural Fusion for Full-frame Video Stabilization
This repository contains the code associated with our 2023 TMI paper "Latent Graph Representations for Critical View of Safety Assessment" and our MICCAI 2023 paper "Encoding Surgical Videos as Spa…
Official implementation of paper [DeepTag: A General Framework for Fiducial Marker Design and Detection]
Affine and Regularized DEformative Numeric Transform (ardent) is a Python package for performing image registration using LDDMM.
Show dependencies tree for .NET assemblies like old Depend Walker show it for non managed applications.
a project for developing registration tools with convolutional neural networks
Image registration laboratory for 2D and 3D image data
An optical flow forward warp's lib with backpropagation using pytorch.
Implementation for Residual Registration Network (R2Net)
深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)
Pytorch implementation of Swin MAE https://arxiv.org/abs/2212.13805