Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View atakansite's full-sized avatar
  • Istanbul Technical University
  • Istanbul
  • 00:45 (UTC +03:00)

Block or report atakansite

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CulturalVLM Project

Python 1 Updated Sep 23, 2025

An implementation of the recently introduced Tversky Neural Networks

Jupyter Notebook 4 Updated Aug 8, 2025

The official dataset of the flowvqa project.

17 2 Updated Mar 26, 2024

An open source implementation of CLIP.

Python 12,834 1,188 Updated Sep 21, 2025

Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023

Python 286 19 Updated Jun 7, 2023
Python 1 Updated Jul 24, 2025

A most Frontend Collection and survey of vision-language model papers, and models GitHub repository. Continuous updates.

414 21 Updated Oct 21, 2025

PyTorch implementation of VQ-VAE by Aäron van den Oord et al.

Jupyter Notebook 592 104 Updated Nov 13, 2019

Evaluate your LLM's response with Prometheus and GPT4 💯

Python 1,006 64 Updated Apr 25, 2025
15 Updated Jul 29, 2024
Python 1 Updated Jun 18, 2025

The official repo of "On the Perception Bottleneck of VLMs for Chart Understanding"

Jupyter Notebook 8 Updated Apr 12, 2025
2 Updated May 22, 2025

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

Python 769 104 Updated Jul 22, 2025

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,641 217 Updated Oct 27, 2025

[NAACL 2025] Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart Understanding

Python 17 3 Updated Aug 23, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,791 2,663 Updated Jul 3, 2025
17 Updated Jun 12, 2024

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 11,309 1,684 Updated Jul 2, 2025

An easy way to extract information from documents

Python 1,781 131 Updated May 3, 2023

A curated list of resources for Document Understanding (DU) topic

1,470 165 Updated Jun 2, 2023

Framework agnostic sliced/tiled inference + interactive ui + error analysis plots

Python 4,894 708 Updated Oct 28, 2025

A collection of AWESOME language modeling techniques on tabular data applications.

32 1 Updated Oct 14, 2024

We collect papers about "large language models (LLM) for table-related tasks", e.g., using LLM for Table QA task. “表格+LLM”相关论文整理

568 41 Updated Sep 9, 2025

MEG: Medical Knowledge-Augmented Large Language Models for Question Answering

Python 9 1 Updated Nov 15, 2024

🦜💯 Flex those feathers!

Python 252 51 Updated Oct 21, 2024

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 15,644 1,220 Updated Oct 27, 2025

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Python 8,998 820 Updated Jul 20, 2025

🦜🔗 Build context-aware reasoning applications

Python 118,289 19,478 Updated Oct 28, 2025
Next