Stars
Learn and understand Docker&Container technologies, with real DevOps practice!
STB-VMM: Swin Transformer Based Video Motion Magnification (official repository)
An unofficial implementation of "Learning-based Video Motion Magnification" in Pytorch.
Learn books from TCP/IP | HTTP(s) | HTML、CSS、JS、JQuery | Vue | PHP | Web | Web Server
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
搜集免费的LLM(大语言模型)API(GPT、Claude、BingCopilot、Llama、Gemini)
A widget for Trilium Notes to preview LaTeX notes.
Build your personal knowledge base with Trilium Notes
[CVPR 2025 Highlight] PyTorch implementation of "Revealing Key Details to See Differences: A Novel Prototypical Perspective for Skeleton-based Action Recognition"
The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the state-of-art performance in LRW-1000 dataset.
ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks
Official Implementation of Visual Transformer Pooling for Lip reading
[BMVC 2023] De-identification of facial videos while preserving remote physiological utility
[CVPR 2024] EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
(IJCV2024 & ICCV2023) LSKNet: A Foundation Lightweight Backbone for Remote Sensing
[NeurIPS 2024 spotlight] Offical implementation of MSFA and release of SARDet_100K dataset for Large-Scale Synthetic Aperture Radar (SAR) Object Detection
A Synchronously Collected External and Internal Fingerprint Database
A trusty face analysis research platform developed by Tencent Youtu Lab
Visual Speech Recognition for Multiple Languages
Simple Motion Capture based on gyro MPU-9250 and ESP32 in Unreal Engine
I2C device library collection for AVR/Arduino or other C++-based MCUs