-
Johns Hopkins University
- Baltimore, MD
- https://wufeim.github.io
Highlights
- Pro
-
SpatialReasonerDataGen Public
Synthetic VQA data generation code for SpatialReasoner.
-
-
-
LVSM Public
Forked from Haian-Jin/LVSM[ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"
Python Other UpdatedAug 11, 2025 -
tok1d Public
Python Packaging for 1D Visual Tokenization and Generation.
-
-
VLMEvalKit Public
Forked from open-compass/VLMEvalKitOpen-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
Python Apache License 2.0 UpdatedJun 8, 2025 -
awesome-3d-spatial-reasoning Public
A curated list of datasets, papers, and codebases on 3D spatial reasoning.
5 UpdatedApr 26, 2025 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedApr 1, 2025 -
lmms-eval Public
Forked from EvolvingLMMs-Lab/lmms-evalAccelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
Python Other UpdatedMar 29, 2025 -
-
-
imagenet3d Public
ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
-
-
SpatialRGPT Public
Forked from AnjieCheng/SpatialRGPT[NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"
Python Apache License 2.0 UpdatedOct 17, 2024 -
infinigen Public
Forked from princeton-vl/infinigenInfinite Photorealistic Worlds using Procedural Generation
Python BSD 3-Clause "New" or "Revised" License UpdatedSep 10, 2024 -
DST3D Public
Official implementation of "Generating images with 3D annotations using diffusion models".
-
-
-
-
imagenet3d_exp Public
Code to reproduce baseline results on ImageNet3D.
-
-
open_clip Public
Forked from mlfoundations/open_clipAn open source implementation of CLIP.
Jupyter Notebook Other UpdatedApr 8, 2024 -
-
omni3d Public
Forked from facebookresearch/omni3dCode release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"
Python Other UpdatedNov 17, 2023 -
ConvNeXt Public
Forked from facebookresearch/ConvNeXtCode release for ConvNeXt model
Python MIT License UpdatedOct 2, 2023 -
mae Public
Forked from facebookresearch/maePyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Python Other UpdatedSep 26, 2023 -
NeMo Public
Neural mesh models for 3D reasoning.
-
-
Grounded-Segment-Anything Public
Forked from IDEA-Research/Grounded-Segment-AnythingGrounded-SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Jupyter Notebook Apache License 2.0 UpdatedJul 25, 2023