Stars
一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎 苹果系统翻译,OpenAI,Gemini,DeepL,Google,Bing,腾讯,百度,阿里,小牛,彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words an…
OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]
Towards Large Multimodal Models as Visual Foundation Agents
Useful resources for creating Design Artificial Intelligence
Implementation of CanvasVAE: Learning to Generate Vector Graphic Documents, ICCV 2021
[CVPR 2023] DeepVecFont-v2: Exploiting Transformers to Synthesize Vector Fonts with Higher Quality
An awesome list of layout generation papers
Figma Files Scraper for Research & Studies
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)
[NIPS 2023] Official implementation for "DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models" https://arxiv.org/abs/2306.14685
[CVPR 2022 Oral] Towards Layer-wise Image Vectorization
[CVPR 2022 Oral] Towards Layer-wise Image Vectorization
The dataset includes UI object type labels (e.g., BUTTON, IMAGE, CHECKBOX) that describes the semantic type of an UI object on Android app screenshots. It is used for training and evaluation of the…
CGL-Dataset v2 for huggingface datasets
SVG Differentiable Rendering: Generating vector graphics using neural networks. Support: text-to-SVG, Image-to-SVG, SVG Editing.
[ECCV 2024] Official repo for UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models
[CVPR 2024] Official implementation for "SVGDreamer: Text Guided SVG Generation with Diffusion Model" https://arxiv.org/abs/2312.16476
A collection of resources on controllable generation with text-to-image diffusion models.
[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).