Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Qiukunpeng's full-sized avatar

Highlights

  • Pro

Block or report Qiukunpeng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.

Python 300 10 Updated Oct 16, 2025

[NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Python 60 2 Updated Oct 29, 2025
Python 23 2 Updated Oct 20, 2025

[ICCV'25 Highlight] Derm1M: A Million‑Scale Vision‑Language Dataset Aligned with Clinical Ontology Knowledge for Dermatology

Python 42 3 Updated Oct 15, 2025

[MICCAI‘25 Early Accept] MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment

Python 11 Updated Oct 26, 2025

Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision

Python 166 3 Updated Sep 25, 2025

Open-source unified multimodal model

Python 5,258 455 Updated Oct 27, 2025

个人构建MoE大模型:从预训练到DPO的完整实践

Python 1,775 139 Updated Nov 5, 2025

[CVPR 2025] Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation

Python 70 3 Updated Sep 11, 2025

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 3,673 250 Updated Sep 25, 2025

[EMNLP 2025 Industry] Datasets and Recipes for Video Temporal Grounding via Reinforcement Learning

Python 29 Updated Oct 22, 2025

【CVPR 2025 Highlight】MonSter: Marry Monodepth to Stereo Unleashes Power

Python 689 51 Updated Oct 31, 2025

Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences (ICML 2025)

Python 25 2 Updated Jun 29, 2025

InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment (CVPR 2025 Highlight)

Python 39 1 Updated Jun 29, 2025

Offical Repo of "Rethinking Brain Tumor Segmentation from the Frequency Domain Perspective" (IEEE TMI 2025)

Python 14 1 Updated Nov 7, 2025

:octocat: A paper list for medical anomaly detection. ℱℯℯ𝓁 𝒻𝓇ℯℯ to contribute!

47 2 Updated Oct 14, 2025
Python 53 2 Updated Jul 1, 2025

[ICLR 2025] NextBestPath: Efficient 3D Mapping of Unseen Environments

Python 63 2 Updated Nov 8, 2025

Offical implementation of "Auto-Regressively Generating Multi-View Consistent Images". (ICCV 2025)

Python 73 2 Updated Jul 26, 2025

[arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps

Python 68 2 Updated Nov 8, 2025

[TMLR 2025] Efficient Reasoning Models: A Survey

Python 275 18 Updated Oct 30, 2025

[AAAI'25] DLF: Disentangled-Language-Focused Multimodal Sentiment Analysis

Python 100 4 Updated Apr 16, 2025

【ICLR 2025 🔥】The code for Consistent In-Context Editing, an approach for tuning language models through contextual distributions, overcoming the limitations of traditional fine-tuning methods that …

Python 46 4 Updated Apr 2, 2025

[ICCV2025] PyTorch implementation of "Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models"

Python 108 4 Updated Jul 25, 2025

Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"

Python 30 Updated Jul 25, 2025

[MICCAI 2025] Adaptively Distilled ControlNet: Accelerated Training and Superior Sampling for Medical Image

Python 12 1 Updated Sep 11, 2025

[NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving"

Python 435 17 Updated Sep 28, 2025

[JAG 2022] Multitask consistency network with single temporal supervision for semi-supervised building change detection

Python 19 1 Updated Aug 25, 2024

[TGRS 2024] CutMix-CD: Advancing Semi-Supervised Change Detection via Mixed Sample Consistency

Python 20 Updated Dec 21, 2024

[ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench

Python 33 Updated Aug 12, 2025
Next