GenAI For Graphics Paper

Uploaded by

mnabih

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views4 pages

GenAI For Graphics Paper

Uploaded by

mnabih

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

GUEST EDITORS' INTRODUCTION

Generative AI for Computer Graphics

Rajesh Sharma € rich, Switzerland
, ETH Zurich, 8092, Zu
Vinicius C. Azevedo , Disney Research Studios, 8006, Zurich, Switzerland
Tomasz Bednarz , NVIDIA, Santa Clara, CA, 95051, USA
Doug Roble , Meta, Menlo Park, CA, 94025, USA

Generative AI has emerged as a transformative force in the realm of computer

graphics, offering innovative methods that push the boundaries of creativity,
efficiency, and realism. In this special issue, we delve into the myriad ways in which
generative AI is reshaping the field, with six articles that explore its applications
across a wide range of topics. These contributions cover advancements in neural
networks for image generation, AI-assisted design tools, deep learning techniques
for realistic simulations, and the future of AI-driven animation. By examining both
the theoretical and practical implications of these developments, this issue
provides a comprehensive overview of how generative AI is enhancing the art and
science of computer graphics. As the field continues to evolve, the articles in this
issue offer a glimpse into the exciting possibilities that lie ahead, illuminating the
potential of AI to redefine creative workflows and technical innovations in graphics.

T
he rise of powerful generative artificial intelli- As we enter this new era, it is clear that Gen AI is
gence (Gen AI) systems has catalyzed a revo- not just enhancing the traditional computer
lutionary paradigm shift in visual computing. graphics pipeline—it is fundamentally reshaping its
These systems generate content through probabilis- foundations and expanding the very notion of what
tic, data-driven algorithms that learn complex visual is computationally and creatively possible. In this
patterns from massive datasets—enabling them to special issue, we will address current challenges in
produce rich, diverse, and highly realistic imagery. this emergent process, along with a short summary
This stands in sharp contrast to the traditional about the articles published in this issue.
computer graphics pipeline, which has long been
grounded in deterministic processes, such as physi-
cal simulation, geometric modeling, and rule-based CHALLENGES IN AI FOR
rendering. At the intersection of these two worlds COMPUTER GRAPHICS
lies a new emerging paradigm: a fusion of statistical
learning and procedural generation that redefines Achieving Realism and Detail in
how images, animations, and virtual worlds are con- Generated Content
ceived and created. This convergence opens the One of the primary challenges of integrating Gen AI in
door to hybrid workflows where machine learning computer graphics is to match the high standards
augments artistic intent, where rules and random- required by pipelines established by the industry, may
ness coexist, and where creativity is no longer they be related to the development of real-time inter-
bound solely by manual craft or simulation fidelity. active applications or to the integration of AI into one
of the many stages of the film production pipeline.
While recent advancements in Gen AI have signifi-
cantly improved image quality and fidelity, issues,
such as inconsistencies, artifacts, and unnatural pat-
0272-1716 ß 2025 IEEE. All rights reserved, including rights for terns, still persist. These issues often happen because
text and data mining, and training of artificial intelligence and
similar technologies. the data-driven process that underlies neural
Digital Object Identifier 10.1109/MCG.2025.3574915 approaches is not constrained to the traditional deter-
Date of current version 16 June 2025. ministic laws that underpin imagery generation in

March/April 2025 Published by the IEEE Computer Society IEEE Computer Graphics and Applications 15
GUEST EDITORS' INTRODUCTION

computer graphics. Addressing these challenges accessibility and practicality for widespread usage,
requires a better understanding of how to embed especially in real-time or interactive applications.
physical constraints into generative pipelines without Research into more efficient model architectures,
hindering their creative capabilities, alongside a optimized training algorithms, and hardware accelera-
deeper understanding of perceptual cues that humans tion is essential to reduce computational costs. Prog-
associate with realism. ress in these areas can democratize Gen AI
technology, allowing smaller studios, independent cre-
Controlling the Generation and Output ators, and real-time applications to benefit fully from
Gen AI’s inherent ability to produce diverse and these AI-assisted pipelines.
novel outputs can simultaneously introduce difficul-
ties in precisely controlling the creative process. Integration Into Existing Workflows
Creators require intuitive tools and mechanisms to The integration of Gen AI technologies into estab-
direct generative models toward desired outcomes lished computer graphics pipelines poses additional
without compromising the creative potential of practical and technical challenges. Many profes-
these systems. Current generative tools are usually sional workflows depend on precise control, repro-
based on text-to-image or text-to-video pipelines; ducibility, and seamless interoperability between
textual controls, while effective to create a signifi- software systems and tools. Gen AI techniques
cant embedding of the data distribution, are not must be adapted or developed in a manner that
effective as an interface to artists when considering aligns smoothly with existing production standards,
different stages of the production pipeline. There- pipelines, and creative practices. Furthermore, the
fore, to better represent user-guided control, there direct output of many current generative models
is a need for development of more interactive inter- typically consists of finalized raster images or video
faces, conditional generation techniques, and robust sequences. This presents a limitation when consid-
methods for specifying constraints or stylistic attrib- ering downstream creative workflows that necessi-
utes. Providing artists and designers with precise tate further manipulation and refinement. To
yet flexible control mechanisms is crucial for the address this, significant research and development
widespread adoption and effective use of Gen AI in efforts are required to adapt these generative archi-
professional workflows. tectures to produce more versatile outputs. Specifi-
cally, models need to be engineered to generate
Training Data and Generalization editable layers, such as those found in professional
The performance and reliability of Gen AI systems are image editing software (e.g., Photoshop), or distinct,
heavily influenced by the quality, diversity, and repre- separable assets, such as foreground objects, back-
sentativeness of the training data. Curating compre- grounds, and individual elements within a scene.
hensive datasets that adequately capture the This capability would unlock new possibilities for
complexity of real-world visual phenomena or diverse artists and designers, allowing them to leverage the
artistic styles is a very complex endeavor. Limited or creative potential of Gen AI while retaining precise
biased datasets can cause generative models to control over the final visual outcome through subse-
exhibit poor generalization capabilities, leading to out- quent editing and compositing stages.
puts that are repetitive, biased, or overly specialized.
Effective solutions require systematic approaches to ARTICLES IN THE SPECIAL ISSUE
data collection, augmentation, and synthetic data As the field continues to evolve, this special issue
generation to ensure models generalize well beyond provides a snapshot of the latest developments and
their initial training conditions, thus expanding their perspectives on Gen AI in computer graphics.
applicability across various graphics-related tasks and Through the formal IEEE Computer Graphics and
scenarios. Applications review process of the six submissions,
we accepted six articles for this special issue. The
Computation Cost and Efficiency six articles in this issue offer insights into different
Another critical challenge is the substantial computa- facets of the subject, from theoretical advance-
tional resources required to train and deploy state-of- ments to practical applications.
the-art Gen AI models. High-resolution and realistic Ye et al.A1 presented a unified visual comparison
outputs typically demand large-scale neural networks, framework that combines neural embeddings with
which are computationally expensive, limiting their computational aesthetics to analyze and compare

16 IEEE Computer Graphics and Applications March/April 2025

GUEST EDITORS' INTRODUCTION

human- and AI-generated paintings. Using CLIP By integrating progressive refinement with seamless
embeddings and interpretable aesthetic features scene transitions, HoloJig maintains user immersion
(such as color, composition, and edge detail), the even during compute-heavy operations. The system
authors develop an interactive visualization system to opens up new possibilities for personalized virtual
explore dataset-level differences and artist-specific experiences across domains, such as remote collab-
style evolution. Case studies, including a comparison oration, education, performance, and simulation,
between real and AI-generated Picasso works, demon- and moves us a step closer to realizing the potential
strate that while AI art approximates stylistic of AI-assisted immersive environments.
elements, it struggles to replicate the nuanced pro- RoutrayA6 presented a comprehensive explora-
gression and diversity found in human creativity. tion of the ethical challenges associated with
Kavouras et al.A2 introduced a framework that Gen AI in computer graphics, including issues of
uses Gen AI, specifically image inpainting, to support authenticity, intellectual property, cultural appropri-
architects and urban planners in creating environmen- ation, and algorithmic bias. It discusses how Gen AI
tally friendly urban interventions. The system gener- blurs the line between real and synthetic visuals,
ates multiple visual alternatives and incorporates a raising concerns about artistic integrity, privacy,
voting-based evaluation method that includes input consent, and the displacement of creative profes-
from both experts and citizens, promoting inclusive sionals. To navigate these complex implications,
and participatory planning. Through four case studies, the author calls for a multidisciplinary approach
the authors demonstrate how this approach can that combines regulatory oversight, ethical literacy,
accelerate early design stages while highlighting the technical safeguards, and inclusive design to
current limitations of AI in producing technically com- ensure responsible and equitable development of
plete solutions. Gen AI technologies.
Kunz et al.A3 introduced a novel framework for
real-time video stylization using text prompts and dif-
fusion models, enabling users to apply artistic effects, CONCLUSION
such as cartoon or painterly styles, to live video The articles in this special issue collectively illus-
streams at 30 fps on commodity GPUs. By combining trate the profound impact of Gen AI on the future
diffusion-based keyframe stylization with a light- of computer graphics, both serving as a powerful
weight, few-shot patch-based training strategy, the tool for technical advancement and a catalyst for
system achieves both high visual quality and temporal creative exploration. From real-time video styliza-
consistency, overcoming previous limitations in speed tion and AI-generated datasets to immersive
and coherence. Especially suited for video conferenc- speech-driven 3-D environments and ethics-cen-
ing, this interactive approach broadens creative tered critiques, the selected works span a rich
expression and user accessibility in real-time graphics spectrum of innovation and inquiry. As Gen AI con-
applications. tinues to evolve, its successful integration into
Mures et al.A4 presented a novel framework for visual computing will depend not only on algorith-
generating synthetic datasets for semantic segmenta- mic breakthroughs, but also on thoughtful design,
tion using controlled diffusion models guided by seg- interdisciplinary collaboration, and ongoing reflec-
mentation maps and text prompts. By bypassing tion on its impact across society, ethics, and the
traditional rendering pipelines, the authors demon- arts. We hope this issue inspires further research
strate how Gen AI can produce high-quality, structur- and responsible innovation at the intersection of AI
ally consistent labeled images across diverse and graphical creativity.
scenarios, such as adverse weather, or specific
domains, such as car part segmentation. Experimental
results show that models trained on these AI-gener- ACKNOWLEDGMENTS
ated datasets consistently outperform those trained The guest editors would like to thank all the authors
on conventionally rendered datasets, suggesting a who submitted their work, as well as the reviewers for
persuasive shift from “Should I render?” to “AI should their thoughtful and dedicated evaluations of the
generate.” many high-quality manuscripts we received. We are
Casas et al.A5 introduced HoloJig, an interactive also grateful to Pak Chung Wong and Chris Weaver for
speech-to-VR system that generates immersive 3-D their leadership at IEEE Computer Graphics and Appli-
environments in real time from spoken prompts cations and for their guidance throughout the prepa-
using diffusion models and depth-based rendering. ration of this special issue.

March/April 2025 IEEE Computer Graphics and Applications 17

GUEST EDITORS' INTRODUCTION

RAJESH SHARMA is currently working toward the Ph.D.

APPENDIX: RELATED ARTICLES degree with ETH Zurich, 8092, Zurich, Switzerland. His
research interests include global illumination in complex
A1. Y. Ye, R. Huang, K. Zhang, and W. Zeng, scenes and simulation solutions for vegetation, hair,
“Unified visual comparison framework cloth, snow, and water. He is the corresponding author of this
for human and AI paintings using article. Contact him at [email protected].
neural embeddings and computational
aesthetics,” IEEE Comput. Graphics Appl.,
vol. 45, no. 2, pp. 19–30, Mar./Apr. 2025, VINICIUS C. AZEVEDO is a research scientist at Disney
doi: 10.1109/MCG.2025.3555122. Research Studios, 8006, Zurich, Switzerland. His research
A2. I. Kavouras, I. Rallis, E. Sardis, A. Doula-
interests include visual computing, physically based anima-
mis, and N. Doulamis, “Voting-based
tions, machine learning, and generative graphics. Contact
intervention planning using AI-gener-
ated images,” IEEE Comput. Graphics him at [email protected].
Appl., vol. 45, no. 2, pp. 31–46, Mar./Apr.
2025, doi: 10.1109/MCG.2025.3553620.
A3. D. Kunz, O. Texler, D. Mould, and D. TOMASZ BEDNARZ is the director of strategic researcher
Sykora, “Meet-in-style: Text-driven real- engagement at NVIDIA Corporation, Santa Clara, CA, 95051,
time video stylization using diffusion USA. His research interests include immersive visualization,
models,” IEEE Comput. Graphics Appl., computational modeling, and AI-driven scientific discovery.
vol. 45, no. 2, pp. 47–56, Mar./Apr. 2025, Contact him at [email protected].
doi: 10.1109/MCG.2025.3554312.
A4. O. A. Mures, M. Silva, M. Lijo -Sanchez,
E. J. Padro n, and J. A. Iglesias-Guitian, DOUG ROBLE is a research scientist at Meta, Menlo Park, CA,
“Should I render or should AI generate?
94025, USA. His research interests include digital humans,
Crafting synthetic semantic segmenta-
machine learning, and physical simulation. Roble received his
tion datasets with controlled genera-
tion,” IEEE Comput. Graphics Appl., Ph.D. degree in computer graphics from The Ohio State Uni-
vol. 45, no. 2, pp. 57–68, Mar./Apr. 2025, versity, Columbus, OH, USA. He is a member of ACM, AMPAS,
doi: 10.1109/MCG.2025.3553494. and the Television Academy. Contact him at [email protected].
A5. L. Casas, S. Hannah, and K. Mitchell,
“HoloJig: Interactive spoken prompt
specified generative AI environments,”
IEEE Comput. Graphics Appl., vol. 45, no.
2, pp. 69–77, Mar./Apr. 2025, doi: 10.1109/
MCG.2025.3553780.
A6. S. K. Routray, “Ethical considerations and
implications of generative AI in computer
graphics,” IEEE Comput. Graphics Appl.,
vol. 45, no. 2, pp. 78–89, Mar./Apr. 2025,
doi: 10.1109/MCG.2025.3570722.

18 IEEE Computer Graphics and Applications March/April 2025

Generative Artificial Intelligence in Creative Contexts
No ratings yet
Generative Artificial Intelligence in Creative Contexts
38 pages
JDSAA Volume6 Issue2 Pages41-59
No ratings yet
JDSAA Volume6 Issue2 Pages41-59
20 pages
Generative Ai With Python Harnessing The Power of Machine Learning and Deep Learning To Build Creative and Intelligent Systems
100% (3)
Generative Ai With Python Harnessing The Power of Machine Learning and Deep Learning To Build Creative and Intelligent Systems
239 pages
Abstract 1
No ratings yet
Abstract 1
1 page
Cellebrite Reader v7.60 Jan 2022 Eng PDF
No ratings yet
Cellebrite Reader v7.60 Jan 2022 Eng PDF
129 pages
The Architecture of Generative AI and Its Role in The Creative Industry
No ratings yet
The Architecture of Generative AI and Its Role in The Creative Industry
18 pages
The Rise of Generative AI in Creative Industries
No ratings yet
The Rise of Generative AI in Creative Industries
4 pages
Creative Image Design With AI
No ratings yet
Creative Image Design With AI
251 pages
Role of Generative AI in Industry 50 A Transformative Force
No ratings yet
Role of Generative AI in Industry 50 A Transformative Force
12 pages
2023 Intro To Generative Ai
No ratings yet
2023 Intro To Generative Ai
15 pages
Gen AI Overview
No ratings yet
Gen AI Overview
3 pages
Desktop Multiple Choice Quiz
100% (3)
Desktop Multiple Choice Quiz
3 pages
Li Et Al. - 2023 - User Experience Design Professionals' Perceptions
No ratings yet
Li Et Al. - 2023 - User Experience Design Professionals' Perceptions
25 pages
Applicationsof Generative AIinthe Creative Sect
No ratings yet
Applicationsof Generative AIinthe Creative Sect
13 pages
Artificial Intelligence For Image Creation - Advances, Applications, and Ethical Challenges
No ratings yet
Artificial Intelligence For Image Creation - Advances, Applications, and Ethical Challenges
4 pages
Aditya Training PPT
No ratings yet
Aditya Training PPT
10 pages
How Generative Ai Is Redefining Creative Content - Systimanx
No ratings yet
How Generative Ai Is Redefining Creative Content - Systimanx
3 pages
Revolutionizing Visuals: The Role of Generative AI in Modern Image Generation
No ratings yet
Revolutionizing Visuals: The Role of Generative AI in Modern Image Generation
22 pages
Illustrating Classic Brazilian Books Using A Text-To-Image Diffusion Model
No ratings yet
Illustrating Classic Brazilian Books Using A Text-To-Image Diffusion Model
7 pages
Image-Dev An Advance Text To Image AI Model
No ratings yet
Image-Dev An Advance Text To Image AI Model
6 pages
Experiment Result 1 - Property Type Analysis
No ratings yet
Experiment Result 1 - Property Type Analysis
3 pages
F C C: E AI A U: ROM Reation To Urriculum Xamining The Role of Generative IN RTS Niversities
No ratings yet
F C C: E AI A U: ROM Reation To Urriculum Xamining The Role of Generative IN RTS Niversities
17 pages
Generative Artificial Intelligence: A Systematic Review and Applications
No ratings yet
Generative Artificial Intelligence: A Systematic Review and Applications
40 pages
Thesis Format University of Auckland
100% (3)
Thesis Format University of Auckland
8 pages
Importance of Artificial Intelligence
No ratings yet
Importance of Artificial Intelligence
3 pages
AI's Impact on Graphic Design
No ratings yet
AI's Impact on Graphic Design
9 pages
Piskopani Et Al 2023 Responsible Ai and
No ratings yet
Piskopani Et Al 2023 Responsible Ai and
5 pages
Paper GenAI
No ratings yet
Paper GenAI
2 pages
Gen AI 1
No ratings yet
Gen AI 1
4 pages
Gen AI Research by XG
No ratings yet
Gen AI Research by XG
5 pages
Genai in The Real World Blog Post
No ratings yet
Genai in The Real World Blog Post
4 pages
IEEE Template
No ratings yet
IEEE Template
5 pages
Point of View On GenAI
No ratings yet
Point of View On GenAI
3 pages
The Impacts of Generative AI - YouTube - English (United States)
No ratings yet
The Impacts of Generative AI - YouTube - English (United States)
5 pages
6022353db86964217e0d0f19-1612854698-Dream Board or Vision Board
100% (1)
6022353db86964217e0d0f19-1612854698-Dream Board or Vision Board
9 pages
Generative AI
No ratings yet
Generative AI
4 pages
Generative Ai & Creative Applications
No ratings yet
Generative Ai & Creative Applications
28 pages
GenAI Basic
No ratings yet
GenAI Basic
2 pages
Artificial Intelligence in Creative Industries: Advances Prior To 2025
No ratings yet
Artificial Intelligence in Creative Industries: Advances Prior To 2025
68 pages
Glossary
No ratings yet
Glossary
14 pages
RiverFlow2D Installation Instructions
No ratings yet
RiverFlow2D Installation Instructions
12 pages
Solutions To Common Errors and Warnings in Cadence Virtuoso IC617
No ratings yet
Solutions To Common Errors and Warnings in Cadence Virtuoso IC617
13 pages
Active Directory Interview Question and Answers
No ratings yet
Active Directory Interview Question and Answers
15 pages
Understanding Generative AI
No ratings yet
Understanding Generative AI
3 pages
Generative AI
No ratings yet
Generative AI
2 pages
A Study On The Influence of Artificial Intelligenc
No ratings yet
A Study On The Influence of Artificial Intelligenc
4 pages
React Deepdive
No ratings yet
React Deepdive
41 pages
Seminar Fin All
No ratings yet
Seminar Fin All
23 pages
VirtualBox Windows Log Analysis
No ratings yet
VirtualBox Windows Log Analysis
24 pages
Parag
No ratings yet
Parag
20 pages
Lect-Gen Ai-0
No ratings yet
Lect-Gen Ai-0
22 pages
Art and The Science of Generative AI - Science
No ratings yet
Art and The Science of Generative AI - Science
6 pages
Advanced Java Programming Lab Manual
No ratings yet
Advanced Java Programming Lab Manual
51 pages
Sequence Control
No ratings yet
Sequence Control
2 pages
AIVariety
No ratings yet
AIVariety
24 pages
IEEE Conference Template
No ratings yet
IEEE Conference Template
6 pages
BBP12 Printer Informational Sheet
No ratings yet
BBP12 Printer Informational Sheet
2 pages
RoleofGenerativeAIinIndustry5 0ATransformativeForce
No ratings yet
RoleofGenerativeAIinIndustry5 0ATransformativeForce
12 pages
The Federal University of Technology
No ratings yet
The Federal University of Technology
9 pages
What Is Generative AI
No ratings yet
What Is Generative AI
14 pages
GenerativeAI EN
No ratings yet
GenerativeAI EN
34 pages
Book Capsules Demo
No ratings yet
Book Capsules Demo
43 pages
DDDDD
No ratings yet
DDDDD
20 pages
Sem 8 Report
No ratings yet
Sem 8 Report
36 pages
Portfolio Research Paper
No ratings yet
Portfolio Research Paper
14 pages
Deepi Pro
No ratings yet
Deepi Pro
63 pages
AI Art in Architecture
No ratings yet
AI Art in Architecture
11 pages
Capabilities Limitations and Challenges of Style T
No ratings yet
Capabilities Limitations and Challenges of Style T
20 pages
Assignment 1 PDF
No ratings yet
Assignment 1 PDF
31 pages
L05 GraphicalMapping
No ratings yet
L05 GraphicalMapping
41 pages
Manuaal Parttennan t7
No ratings yet
Manuaal Parttennan t7
7 pages
Unveiling The Evolution of Generative AI (GAI) A Comprehensive and Investigative Analysis Toward LLM Models (2021-2024) and Beyond
No ratings yet
Unveiling The Evolution of Generative AI (GAI) A Comprehensive and Investigative Analysis Toward LLM Models (2021-2024) and Beyond
21 pages
Module 2 IT ERA
No ratings yet
Module 2 IT ERA
12 pages
Understanding Generative AI Models A Comprehensive Overview
No ratings yet
Understanding Generative AI Models A Comprehensive Overview
13 pages
HKU Sharing
No ratings yet
HKU Sharing
25 pages
Hardware and Networking Service Level
No ratings yet
Hardware and Networking Service Level
52 pages
The Smart Approach To Transactional Printing
No ratings yet
The Smart Approach To Transactional Printing
2 pages
Video Game Sales Analysis Guide
No ratings yet
Video Game Sales Analysis Guide
20 pages
Lecture 6 Process Management by FQ
No ratings yet
Lecture 6 Process Management by FQ
24 pages
Watermarked - A Matter of Perspective - Aug 06 2023 08 39 49
No ratings yet
Watermarked - A Matter of Perspective - Aug 06 2023 08 39 49
15 pages
Smart Drilling Instrumentation SDI OM
No ratings yet
Smart Drilling Instrumentation SDI OM
96 pages
TE 8.1 AdminGuide
No ratings yet
TE 8.1 AdminGuide
237 pages
Generative AI's Impact on Creativity
No ratings yet
Generative AI's Impact on Creativity
23 pages
VLIW and DSP Architecture Explained
No ratings yet
VLIW and DSP Architecture Explained
6 pages
Viz Weather Guide Old Version PDF
No ratings yet
Viz Weather Guide Old Version PDF
289 pages
Galaxy Management
No ratings yet
Galaxy Management
9 pages
ArcGIS TauDEM Guide for Hydrologists
No ratings yet
ArcGIS TauDEM Guide for Hydrologists
39 pages