-
School of Software @ THU -> Multimedia Lab @ CUHK -> KlingAI @ Kuaishou
-
HPSv2 Public
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
-
wikiscenes Public
Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision. ICCV 2021.
-
VideoFlow Public
Forked from XiaoyuShi97/VideoFlowOfficial implementation of ICCV2023 VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation
-
-
-
-
ICCV2023-Diffusion-Papers Public
Forked from Sierkinhane/ICCV2023-Diffusion-PapersICCV2023-Diffusion-Papers
1 UpdatedAug 2, 2023 -
align_sd Public
Better Aligning Text-to-Image Models with Human Preference. ICCV 2023
-
CORA Public
A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023
-
-
diffusers Public
Forked from ShivamShrirao/diffusers🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Python Apache License 2.0 UpdatedMar 26, 2023 -
RegionCLIP Public
Forked from microsoft/RegionCLIP[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"
Python Apache License 2.0 UpdatedJan 10, 2023 -
ast Public
Forked from YuanGongND/astCode for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
Python BSD 3-Clause "New" or "Revised" License UpdatedSep 6, 2021 -
ViViT-pytorch Public
Forked from rishikksh20/ViViT-pytorchImplementation of ViViT: A Video Vision Transformer
Python MIT License UpdatedJun 21, 2021 -
EuclideanMST Public
Forked from AndrewB330/EuclideanMSTImplementations of different algorithms for building Euclidean minimum spanning tree in k-dimensional space.
C++ MIT License UpdatedJun 21, 2021 -
mmdetection Public
Forked from open-mmlab/mmdetectionOpenMMLab Detection Toolbox and Benchmark
Python Apache License 2.0 UpdatedFeb 26, 2021 -
1-stage-wseg Public
Forked from visinf/1-stage-wsegSingle-Stage Semantic Segmentation from Image Labels (CVPR 2020)
Python Apache License 2.0 UpdatedSep 29, 2020 -
DataBase2020 Public
Forked from ArthurCChen/DataBase2020清华大学数据库原理课程大作业(框架为4.23助教更新后版本)开发者:武笑石、黎思宇、陈语凝
Java UpdatedJun 24, 2020 -
银行精准营销解决方案+青蛙叫声聚类分析
-
magic-ruler-simulator Public
A graphical simulator for magic ruler. You can create, operate, save, load your magic ruler, and watch it from different views.
C UpdatedJan 30, 2020 -
-
-
-
-
Front-end-Homework Public
Forked from chuchong/BigHomefront-end final work by cocosStudio
-