Thanks to visit codestin.com
Credit goes to github.com

soumyasj

Follow

Soumya Shamarao Jahagirdar soumyasj

Follow

To all the books and reads for the greater good. PhD at Tubingen

20 followers · 24 following

Stars

PRIME-RL / TTRL

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

Python 883 65 Updated Sep 26, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,994 2,402 Updated Nov 1, 2025

tulerfeng / Video-R1

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]

Python 726 38 Updated Sep 19, 2025

zli12321 / Vision-Language-Models-Overview

A most Frontend Collection and survey of vision-language model papers, and models GitHub repository. Continuous updates.

420 21 Updated Oct 31, 2025

paulgavrikov / visualoverload

VisualOverload is a VQA benchmark for image understanding in dense, high-resolution scenes.

Python 14 Updated Oct 6, 2025

CSHaitao / Awesome-LLMs-as-Judges

The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.

479 22 Updated Jul 29, 2025

ibm-granite / granite-vision-models

Jupyter Notebook 31 6 Updated Jun 25, 2025

LLaVA-VL / LLaVA-NeXT

Python 4,355 413 Updated Sep 14, 2025

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

29,330 2,400 Updated Jun 18, 2024

umd-huang-lab / Mementos

Forked from si0wang/Mementos

Jupyter Notebook 31 Updated Feb 8, 2024

WalBouss / LeGrad

[ICCV25] Official Implementation of LeGrad

Python 82 8 Updated Oct 14, 2024

NielsRogge / Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 11,320 1,685 Updated Jul 2, 2025

JindongGu / Awesome-Prompting-on-Vision-Language-Model

This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.

497 38 Updated Mar 18, 2025

FoundationVision / Groma

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 580 44 Updated Jun 7, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,870 2,657 Updated Aug 12, 2024

Hon-Wong / ByteVideoLLM

[ICCV 2025] Dynamic-VLM

Python 26 Updated Dec 16, 2024

ruizheliUOA / Awesome-Interpretability-in-Large-Language-Models

This repository collects all relevant resources about interpretability in LLMs

377 26 Updated Nov 1, 2024

inseq-team / inseq

Interpretability for sequence generation models 🐛 🔍

Python 444 38 Updated Oct 29, 2025

LambdaLabsML / distributed-training-guide

Best practices & guides on how to write distributed pytorch training code

Python 526 50 Updated Oct 22, 2025

daixiangzi / Awesome-Token-Compress

A paper list of some recent works about Token Compress for Vit and VLM

716 30 Updated Oct 21, 2025

coallaoh / Principles

218 6 Updated Nov 1, 2024

huggingface / evaluation-guidebook

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 1,732 95 Updated Oct 7, 2025

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,287 527 Updated Oct 31, 2025

tsb0601 / MMVP

Python 355 12 Updated Jan 27, 2024

WalBouss / MaskInversion

Python 26 1 Updated Oct 14, 2024

dali92002 / DocEnTR

DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022

Jupyter Notebook 176 36 Updated Jan 17, 2025

Yangyi-Chen / Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

741 42 Updated Oct 20, 2025

Ucas-HaoranWei / Vary

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Python 1,874 144 Updated Dec 30, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

16,584 1,069 Updated Nov 1, 2025

SkalskiP / awesome-foundation-and-multimodal-models

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

Python 636 45 Updated Feb 29, 2024