Stars
Our survey's paper list on Agentic AI, continuously updated with the latest research.
This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!
Mobile-Agent: The Powerful GUI Agent Family
Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models. [ICCV 2023 Oral]
official PyTorch implement of Towards Adversarial Attack on Vision-Language Pre-training Models
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!
YOLOv3 in PyTorch > ONNX > CoreML > TFLite
Retinal Vessel Segmentation in Fundoscopic Images with DenseNet
A python kinect controller for game witcher3