Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View LiWeispace's full-sized avatar

Block or report LiWeispace

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO and designed for fine-tuning.

Python 4,892 545 Updated Nov 13, 2025

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Python 655 25 Updated Oct 25, 2024

The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

Python 9,572 1,610 Updated Jun 26, 2024

DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation

Python 119 9 Updated Jun 3, 2025

[CVPR 2024 HighlightšŸ”„] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Python 944 48 Updated Oct 16, 2024

怐EMNLP 2024šŸ”„ć€‘Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,421 249 Updated Dec 3, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,075 1,088 Updated Nov 18, 2024

a state-of-the-art-level open visual language model | å¤šęØ”ę€é¢„č®­ē»ƒęØ”åž‹

Python 6,710 449 Updated May 29, 2024

[ACL 2024 šŸ”„] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,477 126 Updated Aug 5, 2025

(NeurIPS 2024 Oral šŸ”„) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,134 54 Updated Mar 5, 2025

Compute FID scores with PyTorch.

Python 3,815 524 Updated Jul 3, 2024

This is the offical repository of LLAVIDAL

Python 22 5 Updated Oct 4, 2025

PyTorch code and model checkpoints for Score identity Distillation (SiD) and its adversarial version (SiDA)

Python 144 10 Updated Mar 29, 2025

The official implementation of ā€œOne-for-More: Continual Diffusion Model for Anomaly Detectionā€ (CVPR2025)

Python 52 4 Updated May 7, 2025

šŸ”© PatchCore - easier implementation of this image-level anomaly detector in python

Jupyter Notebook 13 4 Updated Jan 26, 2023

Official repo for consistency models.

Python 6,459 437 Updated Mar 22, 2024

[ICML 2025] This is the official PyTorch implementation of "OmniBal: Towards Fast Instruction-Tuning for Vision-Language Models via Omniverse Computation Balance".

Python 26 3 Updated Jun 16, 2025

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 888 835 Updated Jul 4, 2024

[CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations

Python 125 6 Updated Sep 1, 2025

[CVPR 2024 šŸ”„] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 935 53 Updated Aug 5, 2025

This project makes available the code and data from our NAACL paper: "Capturing Row and Column Semantics in Transformer Based Question Answering over Tables"

Python 55 11 Updated Sep 17, 2025

This repository contains source code for the TaBERT model, a pre-trained language model for learning joint representations of natural language utterances and (semi-)structured tables for semantic p…

Python 606 67 Updated Aug 26, 2021

Source code for "Large Self-Supervised Models Bridge the Gap in Domain Adaptive Object Detection"

Python 46 6 Updated May 18, 2025

A large annotated semantic parsing corpus for developing natural language interfaces.

HTML 1,794 328 Updated Oct 6, 2025

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 32,039 3,865 Updated Jul 23, 2024

[CVPR2025] AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios. Paper is available at https://arxiv.org/abs/2410.14379

Python 129 12 Updated Sep 1, 2025

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 12,105 1,146 Updated Dec 17, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,892 655 Updated Nov 20, 2025

The collection of diffusion models for anomaly detection, a survey paper submitted to IJCAI 2025.

65 5 Updated Jun 14, 2025

Offical implementation of "RealNet: A Feature Selection Network with Realistic Synthetic Anomaly for Anomaly Detection (CVPR 2024)"

Python 406 31 Updated Feb 12, 2025
Next