vit-gpt2

Here are 7 public repositories matching this topic...

Divy005 / image_caption_generator

AI-powered image captioning using InceptionV3+LSTM and ViT-GPT2 models. Trained on Flickr8k dataset with interactive Streamlit interface.

image-captioning nlp-machine-learning keras-tensorflow depplearning computer-vison flicker8k-dataset streamlit-webapp vit-gpt2

Updated Oct 27, 2025
Jupyter Notebook

armanjscript / Argonz-Image-Captioning-Extension

Star

The chrome extension that gets input images and generates the captions for them.

nodejs chrome-extension webpack postcss image-captioning tailwindcss image-caption-generator xenova-transformers vit-gpt2

Updated Dec 5, 2024
JavaScript

ramyacp14 / Image-Caption-Generator

Star

Developed an image captioning system using the BLIP model to generate detailed, context-aware captions. Achieved an average BLEU score of 0.72, providing rich descriptions that enhance accessibility and inclusivity.

machine-learning tensorflow imagenet blip coco-dataset cnn-rnn vision-transformer vit-gpt2

Updated Sep 6, 2024
Jupyter Notebook

ChaituRajSagar / video_to_narrative

Star

Flask-based AI app that summarizes surveillance videos using Whisper (audio), ViT-GPT2 (frame captions), and Groq LLM (narratives). Produces both general and law enforcement-style summaries.

python opencv flask ffmpeg image-captioning whisper groq llm openai-whisper generative-ai vit-gpt2 video-summary law-enforcement-ai surveillance-ai bodycam-analysis

Updated Jul 14, 2025
Python

PrachiPatel15 / Multimodal-Visual-AI-Chatbot

Star

A powerful Streamlit application that analyzes images using multiple vision models and responds to queries about visual content through conversational AI.

blip conversational-ai multimodal-large-language-models vit-gpt2

Updated Feb 26, 2025
Python

PrachiPatel15 / AI-Image-Captioning

Star

An AI-powered image captioning app built with Streamlit, using ViT-GPT2 for caption generation and YOLOv8 for object detection. The app provides enhanced captions by integrating detected objects into the generated text.

computer-vision image-processing transformers streamlit yolov8 vit-gpt2

Updated Feb 21, 2025
Python

abdeldayem02 / Detect-and-Describe

Star

image-captioning object-detection yolov5 vit-gpt2

Updated Oct 29, 2024
HTML

Improve this page

Add a description, image, and links to the vit-gpt2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vit-gpt2 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vit-gpt2

Here are 7 public repositories matching this topic...

Divy005 / image_caption_generator

armanjscript / Argonz-Image-Captioning-Extension

ramyacp14 / Image-Caption-Generator

ChaituRajSagar / video_to_narrative

PrachiPatel15 / Multimodal-Visual-AI-Chatbot

PrachiPatel15 / AI-Image-Captioning

abdeldayem02 / Detect-and-Describe

Improve this page

Add this topic to your repo