OCR with Google's AI technology (Cloud Vision API)
-
Updated
Feb 22, 2023 - Python
OCR with Google's AI technology (Cloud Vision API)
A lightweight and high-speed ComfyUI custom node for generating image captions using BLIP models. Optimized for both GPU and CPU environments to deliver fast and efficient caption generation.
This is a repo providing same stable diffusion experiments, regarding textual inversion task and captioning task
My solution to the Image Captioning Final Project of the Coursera "Introduction to Deep Learning" course with trained model deployed as telegram bot.
SmartMeat is a smart BBQ controllable from your phone
caption generator using lavis and argostranslate
Projects Collections
For transforming normal videos to ASCII style
ASCII Art Generator, locally, in your browser!
Welcome to 📜 Optical_character_recognition using Python 👋
This Python package allows a you to access the img2txt.io API a clean interface.
A custom framework for easy use of LLMs, VLMs, etc. supporting various modes and settings via web-ui
Using AI to write caption for an image (with HuggingFace, Transformer.js or Azure Cognitive Vision API)
Printext is a lightweight, application that extracts text from images.
Turn image to audio story(Upload image and let AI tells a story about it ).
Add a description, image, and links to the img2txt topic page so that developers can more easily learn about it.
To associate your repository with the img2txt topic, visit your repo's landing page and select "manage topics."