Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View HuiLi's full-sized avatar

Block or report HuiLi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

CV

17 repositories

Multimodal-GPT

Python 1,510 131 Updated Jun 4, 2023

YOLOv6: a single-stage object detection framework dedicated to industrial applications.

Jupyter Notebook 5,862 1,058 Updated Aug 7, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,999 1,079 Updated Nov 18, 2024

ImageBind One Embedding Space to Bind Them All

Python 8,849 828 Updated Oct 3, 2025

Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".

Python 1,792 179 Updated Sep 25, 2023

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,890 2,657 Updated Aug 12, 2024

[BIONLP@ACL 2024] XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.

Python 522 63 Updated Aug 8, 2024

Data annotation toolbox supports image, audio and video data.

Python 1,406 154 Updated Oct 1, 2025

✨✨Latest Advances on Multimodal Large Language Models

16,617 1,072 Updated Nov 4, 2025

Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM

Python 101 9 Updated May 17, 2024

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Python 4,171 427 Updated Aug 23, 2024
Python 176 7 Updated Aug 23, 2023

Benchmarking toolkit for patch-based histopathology image classification.

Python 42 4 Updated Jun 2, 2023

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Python 1,070 89 Updated Jun 13, 2024

Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)

Python 623 43 Updated Dec 30, 2024

A Large-Scale In-the-wild Dataset for Plant Disease Segmentation

Python 49 6 Updated Mar 25, 2025