ComfyUI-Blip

A lightweight and high-speed ComfyUI custom node for generating image captions using BLIP models. Optimized for both GPU and CPU environments to deliver fast and efficient caption generation.

Features

Generate captions for images using BLIP models
Support for both base and large BLIP models
Simple and advanced captioning options
Automatic model downloading and caching
High performance on both GPU and CPU

Installation

Navigate to your ComfyUI custom nodes directory:

cd ComfyUI/custom_nodes/

Clone this repository:

git clone https://github.com/1038lab/ComfyUI-Blip.git

Install required dependencies:

pip install -r requirements.txt

Manual Model Download

If automatic download fails, you can manually download the models:

Base model:

https://huggingface.co/Salesforce/blip-image-captioning-base/tree/main

Large model:

https://huggingface.co/Salesforce/blip-image-captioning-large/tree/main

Download the following files and place them in the corresponding directories:

pytorch_model.bin
config.json
preprocessor_config.json
special_tokens_map.json
tokenizer_config.json
tokenizer.json
vocab.txt

Usage in ComfyUI

Basic Node

Add the "Blip Caption" node to your workflow
Connect an image input to the node
Configure the following parameters:
- model_name: Choose between base (faster) or large (more detailed) BLIP model
- max_length: Maximum length of the generated caption (1-100)
- use_nucleus_sampling: Enable for more creative captions

Advanced Node

Add the "Blip Caption (Advanced)" node to your workflow
Connect an image input to the node
Configure the following parameters:
- All basic node parameters
- min_length: Minimum caption length
- num_beams: Number of beams for beam search
- top_p: Top-p value for nucleus sampling
- force_refresh: Force reload model from disk

License

This repository's code is released under the GPL-3.0 License. - see the LICENSE file for details.

Acknowledgments

BLIP - The original BLIP model
ComfyUI - The base framework

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
example_workflows		example_workflows
BlipCaption.py		BlipCaption.py
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
pyproject.toml		pyproject.toml
requirments.txt		requirments.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

ComfyUI-Blip

Features

Installation

Manual Model Download

Usage in ComfyUI

Basic Node

Advanced Node

License

Acknowledgments

About

Uh oh!

Sponsor this project

Uh oh!

Packages

Languages

Uh oh!

License

1038lab/ComfyUI-Blip

Folders and files

Latest commit

History

Repository files navigation

ComfyUI-Blip

Features

Installation

Manual Model Download

Usage in ComfyUI

Basic Node

Advanced Node

License

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Sponsor this project

Uh oh!

Packages 0

Languages

Packages