Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View heliossun's full-sized avatar

Block or report heliossun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2025] Official code for paper: Latent Chain-of-Thought for Visual Reasoning

Python 13 Updated Oct 16, 2025

Official repository of the paper "A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models"

Python 77 1 Updated Sep 10, 2025

[ICCV 2025] Official code for paper: Structured Policy Optimization: Enhance Large Vision-Language Model via Self-Referenced Dialogue

Python 2 Updated Oct 19, 2025

Code for visualizing the loss landscape of neural nets

Python 3,092 433 Updated Apr 5, 2022

A curated list of resources about generative flow networks (GFlowNets).

495 35 Updated Oct 1, 2024

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

208 8 Updated Oct 3, 2025

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 4,073 832 Updated Sep 4, 2025
Python 7 1 Updated May 25, 2024

Self-training LLaVA for medical

Python 16 1 Updated Nov 3, 2024

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,692 290 Updated Aug 14, 2024
Python 4,384 420 Updated Sep 14, 2025

A method to increase the speed and lower the memory footprint of existing vision transformers.

Python 1,117 80 Updated Jun 17, 2024

Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR 2024]

Python 231 20 Updated Mar 23, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,474 31,128 Updated Nov 13, 2025

Github Pages template based upon HTML and Markdown for personal, portfolio-based websites.

HTML 15,811 3,764 Updated Nov 3, 2025

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 14,465 12,468 Updated Nov 7, 2025

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,278 83 Updated Jul 14, 2024

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,279 426 Updated Nov 12, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,779 230 Updated Aug 11, 2024

PyTorch code for our CIKM 2022 paper "Calibrate Automated Graph Neural Network via Hyperparameter Uncertainty"

Python 2 Updated Oct 20, 2022

PyTorch code for ECCV 2022 Oral paper "Modeling Mask Uncertainty in Hyperspectral Image Reconstruction"

Python 26 2 Updated Jul 23, 2022

[ICCV2023 Official PyTorch code] for Iterative Soft Shrinkage Learning for Efficient Image Super-Resolution

Python 28 1 Updated Mar 10, 2024

Generative Models by Stability AI

Python 26,596 2,979 Updated Nov 3, 2025

Robust vision-language understanding via evidential learning

Python 5 Updated Jul 10, 2024

Visual self-questioning for large vision-language assistant.

Python 45 2 Updated Jul 23, 2025

Fast Library for Approximate Nearest Neighbors

C++ 2,345 665 Updated Jul 29, 2024

[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Python 837 43 Updated Aug 19, 2025

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Python 934 43 Updated Sep 27, 2024

🎓 无需编写任何代码即可轻松创建漂亮的学术网站 Easily create a beautiful academic résumé or educational website using Hugo and GitHub. No code.

TeX 4,664 6,478 Updated Nov 9, 2025

Emu Series: Generative Multimodal Models from BAAI

Python 1,755 86 Updated Sep 27, 2024
Next