Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View ABaldrati's full-sized avatar

Organizations

@miccunifi

Block or report ABaldrati

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 1,624 80 Updated Jan 12, 2026

This repository contains code for the paper "Why Diffusion Models Don't Memorize: The Role of Implicit Dynamical Regularization in Training" by T. Bonnaire, R. Urfin, G. Biroli and M. Mézard.

Python 55 5 Updated Nov 27, 2025
5 Updated Dec 1, 2025

Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"

Python 907 45 Updated Dec 23, 2025

Official inference repo for FLUX.2 models

Python 1,348 72 Updated Dec 1, 2025

Mitigating Negative Flips via Margin Preserving Training

Python 3 Updated Nov 16, 2025

This repository contains the official implementation code of NeurIPS 2025 paper: "Instance-Level Composed Image Retrieval".

Python 46 Updated Dec 22, 2025

[ICCV 2025] What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models

Python 12 Updated Nov 3, 2025

[TMLR] Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"

Jupyter Notebook 147 4 Updated Nov 14, 2024

Fully Open Framework for Democratized Multimodal Training

Python 682 56 Updated Dec 27, 2025

Official PyTorch Implementation of "Vision-Free Retrieval: Rethinking Multimodal Search with Textual Scene Descriptions". Accepted at EMNLP 2025

10 Updated Sep 19, 2025

Recurrence Meets Transformers for Universal Multimodal Retrieval

Python 13 Updated Dec 15, 2025

[IJCAI 2025] Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives

Python 28 1 Updated Nov 25, 2025

Official code for the paper "Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Models" (ICLR 2025 Oral)

Python 21 Updated May 11, 2025
2 Updated Nov 26, 2025

[ICCV 2025] - Image Intrinsic Scale Assessment: Bridging the Gap Between Quality and Resolution

Python 3 Updated Aug 16, 2025

[ICCV 2025] - Image Intrinsic Scale Assessment: Bridging the Gap Between Quality and Resolution

Python 16 1 Updated Aug 16, 2025

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 1,427 51 Updated Jan 1, 2026

Official code of Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning

Python 258 13 Updated Sep 24, 2025

Multi-scale Image Super Resolution with a Single Auto-Regressive Model

12 Updated Jun 5, 2025

Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?

Jupyter Notebook 42 2 Updated Jul 26, 2025

Official implementation of the paper "Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals"

Python 30 4 Updated Jun 16, 2025

This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV 2025

Python 22 4 Updated Dec 4, 2025

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,509 75 Updated Mar 16, 2025

OmniGen2: Exploration to Advanced Multimodal Generation. https://arxiv.org/abs/2506.18871

Jupyter Notebook 3,991 14 Updated Dec 2, 2025

[CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations

Python 129 6 Updated Sep 1, 2025

FRED: The Florence RGB-Event Drone Dataset

JavaScript 29 2 Updated Dec 31, 2025

This repo contains the official implementation of the paper "Attention, Please! Revisiting Attentive Probing Through the Lens of Efficiency"

Python 14 Updated Oct 7, 2025

[NeurIPS 2025 Spotlight] ReSim: Reliable World Simulation for Autonomous Driving

137 7 Updated Jan 2, 2026
Next