From Pixels to Words -- Towards Native Vision-Language Primitives at Scale
Haiwen Diao
Paranioar
AI & ML interests
Vision-and-Language, Parameter-efficient Transfer Learning, Multi-modal Large Language Model
Recent Activity
updated
a collection
2 days ago
NEO1_0
upvoted
a
paper
2 days ago
Agent Learning via Early Experience
upvoted
a
paper
2 days ago
From Pixels to Words -- Towards Native Vision-Language Primitives at
Scale