LiWeispace

LiWei LiWeispace

Stars

roboflow / rf-detr

RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO and designed for fine-tuning.

Python 4,892 545 Updated Nov 13, 2025

snap-research / Panda-70M

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Python 655 25 Updated Oct 25, 2024

xuebinqin / U-2-Net

The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

Python 9,572 1,610 Updated Jun 26, 2024

yinyjin / DualAnoDiff

DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation

Python 119 9 Updated Jun 3, 2025

PKU-YuanGroup / Chat-UniVi

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Python 944 48 Updated Oct 16, 2024

PKU-YuanGroup / Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,421 249 Updated Dec 3, 2024

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,075 1,088 Updated Nov 18, 2024

zai-org / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,710 449 Updated May 29, 2024

mbzuai-oryx / Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,477 126 Updated Aug 5, 2025

tianweiy / DMD2

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,134 54 Updated Mar 5, 2025

mseitzer / pytorch-fid

Compute FID scores with PyTorch.

Python 3,815 524 Updated Jul 3, 2024

ADL-X / LLAVIDAL

This is the offical repository of LLAVIDAL

Python 22 5 Updated Oct 4, 2025

mingyuanzhou / SiD

PyTorch code and model checkpoints for Score identity Distillation (SiD) and its adversarial version (SiDA)

Python 144 10 Updated Mar 29, 2025

FuNz-0 / One-for-More

The official implementation of “One-for-More: Continual Diffusion Model for Anomaly Detection” （CVPR2025)

Python 52 4 Updated May 7, 2025

chlotmpo / PathCore_anomaly_detection

🔩 PatchCore - easier implementation of this image-level anomaly detector in python

Jupyter Notebook 13 4 Updated Jan 26, 2023

openai / consistency_models

Official repo for consistency models.

Python 6,459 437 Updated Mar 22, 2024

ModelTC / OmniBal

[ICML 2025] This is the official PyTorch implementation of "OmniBal: Towards Fast Instruction-Tuning for Vision-Language Models via Omniverse Computation Balance".

Python 26 3 Updated Jun 16, 2025

TRI-ML / prismatic-vlms

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 888 835 Updated Jul 4, 2024

ExplainableML / flair

[CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations

Python 125 6 Updated Sep 1, 2025

mbzuai-oryx / groundingLMM

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 935 53 Updated Aug 5, 2025

IBM / row-column-intersection

This project makes available the code and data from our NAACL paper: "Capturing Row and Column Semantics in Transformer Based Question Answering over Tables"

Python 55 11 Updated Sep 17, 2025

facebookresearch / TaBERT

This repository contains source code for the TaBERT model, a pre-trained language model for learning joint representations of natural language utterances and (semi-)structured tables for semantic p…

Python 606 67 Updated Aug 26, 2021

TRAILab / DINO_Teacher

Source code for "Large Self-Supervised Models Bridge the Gap in Domain Adaptive Object Detection"

Python 46 6 Updated May 18, 2025

salesforce / WikiSQL

A large annotated semantic parsing corpus for developing natural language interfaces.

HTML 1,794 328 Updated Oct 6, 2025

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 32,039 3,865 Updated Jul 23, 2024

HUST-SLOW / AnomalyNCD

[CVPR2025] AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios. Paper is available at https://arxiv.org/abs/2410.14379

Python 129 12 Updated Sep 1, 2025

facebookresearch / dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 12,105 1,146 Updated Dec 17, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,892 655 Updated Nov 20, 2025

fdjingliu / DMAD

The collection of diffusion models for anomaly detection, a survey paper submitted to IJCAI 2025.

65 5 Updated Jun 14, 2025

cnulab / RealNet

Offical implementation of "RealNet: A Feature Selection Network with Realistic Synthetic Anomaly for Anomaly Detection (CVPR 2024)"

Python 406 31 Updated Feb 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly