Lists (1)
Sort Name ascending (A-Z)
Stars
手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube
A paper list of object detection using deep learning.
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Incorporating VIsual LAyout Structures for Scientific Text Classification
This is a multi-modal fusion method based on VGG16 and FastText for identifying useful information collected from social media platforms.
An AI-powered tool for automatic lecture transcription, summarization, and quiz generation. It leverages advanced speech recognition, natural language processing, and video analysis to transform le…
多模态数据特质提取、融合
Pytorch implementation of Multimodal Fusion Transformer for Remote Sensing Image Classification.
🎁 A Large-scale Multi-modal E-Commerce Products Dataset (LTDL@IJCAI-21 Best Dataset & Pattern Recognition 2023)