0% found this document useful (0 votes)

5 views38 pages

GFXHW

Uploaded by

Kerolaine Amorim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views38 pages

GFXHW

Uploaded by

Kerolaine Amorim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 38

Graphics Hardware

Tomas Akenine-Möller
Department of Computer Engineering
Chalmers University of Technology
Graphics hardware – why?
 About 100x faster!
 Another reason: about 100x faster!
 Simple to pipeline and parallelize

 There is currently only hardware for triangle

rasterization with texturing (e.g., OpenGL
acceleration)
 Ray tracing: there are research architetures,
and one commercial product
– More to come!

Tomas Akenine-Mőller © 2003

Today’s topics
 The basics of ”perspective correct
texturing”
 Background on graphics hardware
 The architecture of the XBOX
 The architecture of the KYRO

 Thereis very little documentation on

graphics architectures…

Tomas Akenine-Mőller © 2003

Perspective-correct texturing
 How is texture coordinates interpolated over a triangle?
 Linearly?

Linear interpolation Perspective-correct interpolation

 Perspective-correct interpolation gives foreshortening effect!
 Hardware does this for you, but you need to understand this anyway!

Tomas Akenine-Mőller © 2003

Recall the following, and then we
change notation a bit
 Before projection, v, and after p (p=Mv)
 After projection p is not 1!
w
 Homogenization: (p /p , p /p , p /p , 1)
x w y w z w
 Rewrite and change notation to:
– w=pw
– And instead use: (pxw , py w , pz w , w)
– After homogenization: (px , py, pz, 1)
 Also, remember that visible points, (px , py,
pz, 1), are inside a unit-cube:
(-1,-1,-1)  (1,1,1)
Tomas Akenine-Mőller © 2003
Texture coordinate interpolation
 Linear interpolation does not work
 Rational linear interpolation does:
– u(x)=(ax+b) / (cx+d)
– a,b,c,d are computed from triangle’s vertices (x,y,z,w,u,v)
 Not really efficient
 Smarter:
– Compute (u/w,v/w,1/w) per vertex
– These quantities can be linearly interpolated!
– Then at each pixel, compute 1/(1/w)=w
– And obtain: (w*u/w,w*v/w)=(u,v)
– The (u,v) are perspectively-correct interpolated
 Need to interpolate shading this way too
– Though, not as annoying as textures
 Since linear interpolation now is OK, compute, e.g.,
(u/w)/x, and use this to update u/w when stepping in
the x-direction (similarly for other parameters)
Tomas Akenine-Mőller © 2003
Background:
Graphics hardware architectures
 Evolution of graphics hardware has
started from the end of the pipeline
– Rasterizer was put into hardware first (most
performance to gain from this)
– Then the geometry stage
– Application will not be put into hardware!
 Two major ways of getting better
performance:
– Pipelining
– Parallellization
– Combinations of these are often used
Tomas Akenine-Mőller © 2003
Briefly about pipelining
 In GeForce3: 600-800 pipeline stages!
– 57 million transistors
– Pentium IV: 20 stages, 42 million transistors
 Newer cards:
– Radeon 9700: 110M transistors
– GeForce FX 5800: 125 M transistors, 500 MHz
 Ideally: n stages  n times throughput
– But latency increases!
– However, not a problem here
 Chip runs at about 200 MHz (5ns per clock)
 5ns*700=3.5 s
 We got about 20 ms per frame (50 frames per second)
 Graphics hardware is simpler to pipeline because:
– Pixels are (most often) independent of each other
– Few branches and much fixed functionality
– Don’t need high clock freq: bandwidth to memory is bottleneck
 This is changing with increased programmability
– Simpler to predict memory access pattern (do prefecthing!)
Tomas Akenine-Mőller © 2003
Parallellism
 ”Simple” idea: compute n results in parallel,
then combine results
 GeForce FX 5800: 8 pixels/clock, 16 textures/clock
– With a pipeline of several 100 stages, there are many
pixels being processed simultaneously
 Not always simple!
– Try to parallelize a sorting algorithm…
– But pixels are independent of each other, so simpler for
graphics hardware
 Can parallellize both geometry and rasterizer:

Tomas Akenine-Mőller © 2003

Taxanomy of hardware
 Need to sort from model space to screen
space
 Gives four major architectures:
– Sort-first
– Sort-middle
– Sort-Last Fragment
– Sort-Last Image

 Willdescribe these briefly, and then

focus on sort-middle and sort-last
fragment (used in commercial hardware)
Tomas Akenine-Mőller © 2003
Sort-First
 Sorts primitives before geometry
stage
– Screen in divided into large regions
– A separate pipeline is responsible for each
region (or many)
G is geometry, FG & FM is part of rasterizer
– A fragment is all the generated information for a pixel on a
triangle
– FG is Fragment Generation (finds which pixels are inside
triangle)
– FM is Fragment Merge (merges the created fragments with
various buffers (Z, color))
 Not explored much at all Tomas Akenine-Mőller © 2003
Sort-Middle
 Sorts betwen G and R
 Pretty natural, since after G, we know the
screen-space positions of the triangles
 Most hardware uses this!
– Examples include InfiniteReality (from SGI) and the
KYRO architecture (from Imagination)
 Spread work arbitrarily among G’s
 Then depending on screen-space position, sort to different R’s
– Screen can be split into ”tiles”. For example:
 Rectangular blocks (8x8 pixels)
 Every n scanlines
 The R is responsible for rendering inside tile
 A triangle can be sent to many FG’s depending on overlap
(over tiles)

Tomas Akenine-Mőller © 2003

Sort-Last Fragment
 Sorts
betwen FG and FM
 XBOX uses this!

 Again spread work among G’s

 The generated work is sent to FG’s
 Then sort fragments to FM’s
– An FM is responsible for a tile of pixels
A triangle is only sent to one FG, so this avoids
doing the same work twice
– Sort-Middle: If a triangle overlaps several tiles, then the
triangle is sent to all FG’s responsible for these tiles
– Results in extra work

Tomas Akenine-Mőller © 2003

Sort-Last Image
 Sorts after entire pipeline
 So each FG & FM has a separate
frame buffer for entire screen (Z and
color)
 After all primitives have been sent to
pipeline, the z-buffers and color buffers are
merged into one color buffer
 Can be seen as a set of independent pipelines
 Huge memory requirements!
 Used in research, but probably not
commerically
Tomas Akenine-Mőller © 2003
Memory bandwidth usage is huge!!
R is read, W is write, T is texture, Z is Z-buffer,
C is color buffer
 Assuming 2 textures per pixel, and TR costs 24
bytes (triline MIP-mapping), the rest costs 32
bits (4 bytes)
 A ”normal” pixel costs:
– ZR+ZW+CW+2*TR=60 bytes per pixel
 At60 fps, 1280x1024: 4.5 Gb/s
 But a pixel is overwritten many times!
 Overdraw=4 gives: 18 Gb/s !
 Then assume DDRAM at 300 MHz, 256 bits
per access: 9.6 Gb/s
 18>9.6 !!
Tomas Akenine-Mőller © 2003
Memory bandwidth, cont’d
 18>9.6
 On top of that bandwith usage is never
100%, and we can also use more
textures, anti-aliasing, to use up even
more bandwidth
 However, there are many techniques to
reduce bandwith usage:
– Texture caching with prefetching
– Texture compression
– Z-compression
– Z-occlusion testing (HyperZ)
Tomas Akenine-Mőller © 2003
Z-occlusion testing and Z-
compression
 One way of reducing bandwidth
– ATI Inc., pioneered with their HyperZ technology
 Very simple, and very effective
 Divide screen into tiles of 8x8 pixels
 Keep a status memory on-chip
– Very fast access
– Stores additional information that this algorithm
uses
 Enables occlusion culling on triangle
basis, z-compression, and fast Z-clears
Tomas Akenine-Mőller © 2003
Architecture of
Z-cull and Z-
compress

 Store zmax per tile, and a flag (whether cleared,

compressed/uncompressed)
 Rasterize one tile at a time
 Test if zmin on triangle is farther away than tile’s zmax
– If so, don’t do any work for that tile!!!
– Saves texturing and z-read for entire tile – huge savings!
 Otherwize read compressed Z-buffer, & unpack
 Write to unpacked Z-buffer, and when finished compress
and send back to memory, and also: update zmax
 For fast Z-clears: just set a flag to ”clear” for each tile
– Then we don’t need to read from Z-buffer, just send cleared Z for
that tile Tomas Akenine-Mőller © 2003
The Xbox game console
 Builtby Microsoft
and NVIDIA
 Is almost a PC:
– Pentium III, 733 MHz
– An extended GeForce3
 Why a console then?
– It stays constant…
– You don’t have to care
about 20 different
graphics cards, and
CPUs from 100 MHz to
2GHz

Tomas Akenine-Mőller © 2003

Xbox is a UMA machine
 UMA = unified memory architecture
– Every component in the system accesses the same
memory
We focus on the
GPU

Tomas Akenine-Mőller © 2003

Xbox Graphics Processing Unit
(GPU)
 Supports programmable
vertex shaders
– No fixed-function geometry
stage
 Is sort-last fragment
architecture
 Rasterizer: handles four
pixels per clock
 Runs at 250 MHz

Tomas Akenine-Mőller © 2003

Xbox geometry
stage
 Dual vertex shaders
– Same vertex program
is executed on two
vertices in parallell
 Vertex shader unit is a SIMD machine that operates
on 4 components at a time
– The point is that instead of a fixed function geometry
stage, we have now full control over animation of vertices
and lighting etc.
 Uses DMA (direct memory access), so that the
GPU fetches vertices directly from memory by itself!
 Three different caches – for better performance!

Tomas Akenine-Mőller © 2003

Xbox geometry stage:
caches
 Pre T&L (transform & lighting)
– Stores vertices fetched from mem
– The idea is the avoid redundant memory fetches
– A vertex is, on average, shared by 6 triangles
– Has 4 kbytes of storage
 Post T&L cache:
– Avoid running vertex shader more than once for same vertex
– So it has storage for 16 transformed vertices
 Primitive Assembly cache:
– A transformed vertex requires a lot of memory, and so it takes a
while to fetch a vertex from the Post T&L cache
– Can store 3 fully shaded vertices
– Is there to avoid fetches from Post T&L
 Task of PA cache is to feed rasterizer with triangles
Tomas Akenine-Mőller © 2003
Xbox rasterizer
 First block: triangle setup
(TS) and FG
 TS computes various
deltas (see slide 6) and
other startup info
 This block also does Z-occlusion testing
 FG generates fragments inside triangles
– Tests 2x2 pixels at a time, and forwards these to the four pipelines that follow
– Note: near edges, not all pixels are inside triangles, and therefore 0-3
pipelines may be idle
– There are many strategies on how to find which fragments are inside triangle,
but exactly how this is done on the XBOX is not known

Xbox rasterizer
 Sorting is done after FG
– Sort-last fragment arch.
 First: 2 texture units
– Can be run twice  4
texture lookups
 RC (register combiners) operate on the filtered texel
values from TC and from interpolated shading over
triangle (programmable too)
– Can be used for bump mapping, for example
 Finally,result from TXs, RCs, shading interpolation, fog
interpolation is merged into a final color for that pixel

Xbox rasterizer:
Fragment merge
 The combiner produced a final color for the
pixel on a triangle
 FG merges this with:
– Color in color buffer (alpha blending)
– Respect to Z-buffer
– Stencil testing
– Alpha testing
 Z-compression and decompression is handled
here as well
 Writes final color over the system memory bus

Xbox texture swizzling
A technique to improve usage of locality
in textures
– Not likely that we will access texels in a linear
fashion (i.e., one scanline at a time)
– Use swizzling instead
 Assume (u,v)=(un-1…u1u0, vn-1…v1v0)
– ui and vi are bits i of u and v
 Linear (normal): (width*v+u)*bytes_per_color
 Instead: (u v …u v u v )* bytes_per_color
n-1 n-1 1 1 0 0

Xbox texture
swizzling
 This access
technique gives
the following
pattern (4
bytes/color)
 This is a space-
filling curve, and
those are often
designed so that
coherency usage Almost a Hilbert curve
is improved
 Example: bilinear
filtering
Tomas Akenine-Mőller © 2003
Xbox conclusion
 (Almost) a PC with great graphics
hardware
 Sort-last fragment architecture
 2 vertex shaders
 4 pixel pipelines @ 250 MHz
 Programmable per pixel as well
 One of the best consoles right now…
– Not for long though

KYRO – a different architecture
 Based on cost-effective PowerVR architecture
 Tile-based
– For KYRO II: 32x16 pixels
 Fundamental difference
– For entire scene, do this:
– Find all triangles inside each tile
– Render all triangle inside tile
 Advantage: can implement temporary color,
stencil, and Z-buffer in fast on-chip memory
 Saves memory and memory bandwidth!
– Claims to save 2/3 of bandwidth compared to traditional
architecture (without Z-occlusion testing)
Tomas Akenine-Mőller © 2003
KYRO architecture overview

 CPU sends triangle data to KYRO II

 Tile Accelerator (TA)
– Need an entire scene before ISP and TSP blocks can start
– So TA works on the next image, while ISP and TSP works
on the current image (i.e., they work in a pipelined fashion)
– TA sorts triangles, and creates a list of triangle pointers for
each tile (for tris inside tile)

KYRO
 Tile accelerator:
– When all triangle for entire scene are sorted into tiles, the
TA can send tile data to next block ISP
– And the TA then continues on the next frame’s sorting in
parallel
 Image synthesis processor (ISP):
– Implements Z-buffer, color buffer, stencil buffer for tile
– And occlusion culling (similar to Z-occlusion testing)
 Test 32 pixels at a time against Z-buffer
 Records which pixels are visible

– Groups pixels with same texture and sends to TSP

 These are guaranteed to be visible, so we only texture each pixel
once
Tomas Akenine-Mőller © 2003
KYRO: TSP
 Texture and Shading Processor (TSP):
– Handles texturing and shading interpolation
 Has two pipelines that run in parallell
– 2 pixels per clock
 Can use 8 textures at most
– Is implemented by ”looping” in TSP
– I.e., not full speed
 Texturedata is fetched from local memory
 Supersampling: 2x1, 1x2, and 2x2
– Renders a larger image and filters and scales down
– For 2x2: Need only 4x the size of tile (or rather, render 4x
as many tiles, i.e., need not 4x memory)Tomas Akenine-Mőller © 2003
KYRO: pros and cons
 Uses a small amount of very fast memory
– Reduces bandwidth greatly
– Reduces frame buffer memory greatly
 But more local memory is needed
– For tile sorting
– Amount of local memory places a limit on how many
triangles can be rendered
– 3 MB can handle a little over 30,000 triangles
 Design is parallel
– Add more pipelines that can handle the rest of the
architecture that follows the Tile Accelerator
– But bottleneck will (likely) move, and so not sure how
much can be gained
Tomas Akenine-Mőller © 2003
Challenges for the future
 Continueto push the frontier of ”normal”
graphics hardware
– How long can the ”2x performance per 6
months” keep up?
– Keep adding new features…
– Next generation is expected to be massively
programmable, both at vertices and at pixels
– Another goal is to make rendering more realistic
 Dothis by developing new algorithms for the
programmable hardware

Challenges for the future
 Design a new architecture targeted for global
illumination
 Very few have focused on ”ray tracing”- based
algorithms so far
 It is time now…
 Would be nice with:
– Rapid intersection testing of curved surfaces in hardware
– Rapid traversal of spatial data structure
– Handling of very large scenes
 Standard graphics hardware can handle quite good because a
triangle can be discared once it has been rendered
 Ray tracing-based algorithms cannot do this, because it renders
shadows and reflections and therefore need to know of geometry
nearby
– Photon mapping…
Tomas Akenine-Mőller © 2003
Challenges for the future
 Design really small architectures with
really scarce resources
– Little chip area
– Little memory
– Little bandwidth
 Sothat it can be used in mobile devices,
e.g., PalmPilot’s, phones, etc.

Graphics hardware conclusion
 Possible to build great hardware for standard
triangle rendering
– Reasons: pixel independency, parallellism, pipelining,
etc.
 Ray tracing-based hardware will come
– It has been shown that commodity graphics hardware
can be used for ray tracing
– See paper by Tim Purcell et al., SIGGRAPH 2002
 Not sure what will happen in the future, but it
will happen pretty fast
– ”it will be utterly fantastic”

Scenegraphs Past, Present and Future
100% (4)
Scenegraphs Past, Present and Future
20 pages
GPU Pro 2 (Edited by W.Engel) (2011)
No ratings yet
GPU Pro 2 (Edited by W.Engel) (2011)
490 pages
CG Unit 1 Notes
100% (3)
CG Unit 1 Notes
32 pages
3D Graphics & Terrain Modeling
100% (2)
3D Graphics & Terrain Modeling
59 pages
Computer Graphics (Book)
No ratings yet
Computer Graphics (Book)
245 pages
Project Report On Graphics Editor Using Open GL
61% (18)
Project Report On Graphics Editor Using Open GL
31 pages
An Interactive Introduction To Opengl Programming
No ratings yet
An Interactive Introduction To Opengl Programming
161 pages
Three-Dimensional Computer Graphics Architecture: Tulika Mitra and Tzi-Cker Chiueh
No ratings yet
Three-Dimensional Computer Graphics Architecture: Tulika Mitra and Tzi-Cker Chiueh
9 pages
Lecture 1 - Computer Graphics - Chapter 1 Introduction
No ratings yet
Lecture 1 - Computer Graphics - Chapter 1 Introduction
57 pages
Seventh Sem
No ratings yet
Seventh Sem
12 pages
Computing Architectures For Virtual Reality: Electrical and Computer Engineering Dept
100% (1)
Computing Architectures For Virtual Reality: Electrical and Computer Engineering Dept
136 pages
Core Techniques and Algorithms in Game Programming PDF
No ratings yet
Core Techniques and Algorithms in Game Programming PDF
745 pages
3d Game Development With LWJGL
No ratings yet
3d Game Development With LWJGL
299 pages
Module 2
No ratings yet
Module 2
36 pages
Modern GPU Architecture
No ratings yet
Modern GPU Architecture
93 pages
Graphic Technology: Presentation On
No ratings yet
Graphic Technology: Presentation On
50 pages
A Brief Introduction To 3d
100% (1)
A Brief Introduction To 3d
84 pages
Lua Game Development Cookbook - Sample Chapter
No ratings yet
Lua Game Development Cookbook - Sample Chapter
56 pages
3D in Computer Graphics: History Key Concepts Application
No ratings yet
3D in Computer Graphics: History Key Concepts Application
39 pages
برمجة الالعاب 2025
No ratings yet
برمجة الالعاب 2025
239 pages
Gameenginegems 2
No ratings yet
Gameenginegems 2
526 pages
GPU Clay Simulation & Ray-Tracing
No ratings yet
GPU Clay Simulation & Ray-Tracing
70 pages
CCS347 GD - Unit 3
No ratings yet
CCS347 GD - Unit 3
47 pages
3d Graphics Pipeline
100% (1)
3d Graphics Pipeline
4 pages
CG Unit-I
No ratings yet
CG Unit-I
66 pages
A Brief Overview of The Graphics Pipeline: Cedric Lee
No ratings yet
A Brief Overview of The Graphics Pipeline: Cedric Lee
33 pages
59 HowThingsWork
No ratings yet
59 HowThingsWork
5 pages
Introduction To Graphics
No ratings yet
Introduction To Graphics
10 pages
The Evolution of Gpus For General Purpose Computing
No ratings yet
The Evolution of Gpus For General Purpose Computing
38 pages
2D Graphics 3D Graphics
No ratings yet
2D Graphics 3D Graphics
12 pages
Parallel Distributed Computing
No ratings yet
Parallel Distributed Computing
38 pages
Unit 3 Notes - Unit3
No ratings yet
Unit 3 Notes - Unit3
17 pages
Understanding The Graphics Pipeline
No ratings yet
Understanding The Graphics Pipeline
35 pages
Computer Graphics Course Guide
No ratings yet
Computer Graphics Course Guide
57 pages
Computer Graphics Transformations
No ratings yet
Computer Graphics Transformations
14 pages
Graphics Pipeline & Rasterization MIT
No ratings yet
Graphics Pipeline & Rasterization MIT
98 pages
Game Engine Design Insights
No ratings yet
Game Engine Design Insights
20 pages
Interactive Computer Graphics Intro
No ratings yet
Interactive Computer Graphics Intro
17 pages
Smartshader
100% (1)
Smartshader
23 pages
Z-Buffer Optimizations: Patrick Cozzi Analytical Graphics, Inc
No ratings yet
Z-Buffer Optimizations: Patrick Cozzi Analytical Graphics, Inc
37 pages
Graphics Performance Optimization
No ratings yet
Graphics Performance Optimization
44 pages
3D Graphics Rendering: Technology and Historical Overview
No ratings yet
3D Graphics Rendering: Technology and Historical Overview
55 pages
Real-Time Simulation of Water Surface
No ratings yet
Real-Time Simulation of Water Surface
9 pages
Graphics Notes
No ratings yet
Graphics Notes
196 pages
3D Pipeline Tutorial
No ratings yet
3D Pipeline Tutorial
41 pages
The OpenGL Viewing Pipeline Explained
No ratings yet
The OpenGL Viewing Pipeline Explained
7 pages
Gsprogramming 1
No ratings yet
Gsprogramming 1
48 pages
Lect 2
No ratings yet
Lect 2
23 pages
3D Graphics Basics for i.MX MBX
No ratings yet
3D Graphics Basics for i.MX MBX
16 pages
The Toy Bird: 6 Presentation (Week 1)
No ratings yet
The Toy Bird: 6 Presentation (Week 1)
27 pages
CSC 381 Computer Graphics Practical Session - Introduction Slide
No ratings yet
CSC 381 Computer Graphics Practical Session - Introduction Slide
22 pages
Understanding Graphics APIs & GPUs
No ratings yet
Understanding Graphics APIs & GPUs
55 pages
2D Game Engine: Bachelor's Thesis
No ratings yet
2D Game Engine: Bachelor's Thesis
77 pages
How Gpus Work
No ratings yet
How Gpus Work
5 pages
Unit 2: Graphics Programming The Opengl: by Shubha Raj K.B
No ratings yet
Unit 2: Graphics Programming The Opengl: by Shubha Raj K.B
146 pages
Interactive 3d Graphics
No ratings yet
Interactive 3d Graphics
60 pages
Graphics APIs and Pipelines Guide
No ratings yet
Graphics APIs and Pipelines Guide
40 pages
Hill CH 5
No ratings yet
Hill CH 5
158 pages
Computer Graphics (CG CHAP 2)
No ratings yet
Computer Graphics (CG CHAP 2)
32 pages
Acg01 Intro
No ratings yet
Acg01 Intro
34 pages
How A GPU Works - Kayvon Fatahalian
No ratings yet
How A GPU Works - Kayvon Fatahalian
87 pages
1 Ete CG
No ratings yet
1 Ete CG
24 pages
3 Doverview Slides
No ratings yet
3 Doverview Slides
24 pages
COMPUTER GRAPHICS Syllabus
No ratings yet
COMPUTER GRAPHICS Syllabus
2 pages
How Modern GPUs Work and Evolve
No ratings yet
How Modern GPUs Work and Evolve
87 pages
GPU Evolution for Tech Enthusiasts
No ratings yet
GPU Evolution for Tech Enthusiasts
21 pages
Week02 (Basic Primitives)
No ratings yet
Week02 (Basic Primitives)
30 pages
3D Production Pipeline - Production 2
No ratings yet
3D Production Pipeline - Production 2
14 pages
Course Introduction: Gustavo Patow GGG - Udg
No ratings yet
Course Introduction: Gustavo Patow GGG - Udg
17 pages
OpenGL 3D Graphics Basics
No ratings yet
OpenGL 3D Graphics Basics
22 pages
QR 1
No ratings yet
QR 1
24 pages
The Graphics Pipeline and Opengl I:: Transformations!
No ratings yet
The Graphics Pipeline and Opengl I:: Transformations!
77 pages
CGV Complete
No ratings yet
CGV Complete
44 pages
Evolution of GPU Architecture
No ratings yet
Evolution of GPU Architecture
21 pages
Computer Graphics and Visualization
No ratings yet
Computer Graphics and Visualization
77 pages
Computer Graphics Assignemnet
No ratings yet
Computer Graphics Assignemnet
17 pages
Aaltonenhaar Siggraph2015 Combined Final Footer 220dpi
No ratings yet
Aaltonenhaar Siggraph2015 Combined Final Footer 220dpi
60 pages
The End of The Gpu Roadmap: Tim Sweeney CEO, Founder Epic Games
No ratings yet
The End of The Gpu Roadmap: Tim Sweeney CEO, Founder Epic Games
74 pages
GPU Market and Rendering Advances
No ratings yet
GPU Market and Rendering Advances
11 pages
Module 2 - IAT 2
No ratings yet
Module 2 - IAT 2
9 pages
Introduction To Graphics Hardware and Gpus Introduction To Graphics Hardware and Gpus
No ratings yet
Introduction To Graphics Hardware and Gpus Introduction To Graphics Hardware and Gpus
22 pages
Architecture of A Graphics Pipeline
No ratings yet
Architecture of A Graphics Pipeline
21 pages
Changelog
No ratings yet
Changelog
17 pages
Parallel Rendering for MIMD Systems
No ratings yet
Parallel Rendering for MIMD Systems
26 pages
3D Computer Graphics in A Nutshell
No ratings yet
3D Computer Graphics in A Nutshell
17 pages
Learning To Predict 3D Objects With An Interpolation-Based Differentiable Renderer
No ratings yet
Learning To Predict 3D Objects With An Interpolation-Based Differentiable Renderer
11 pages
08 gpuSoftwareRasterLaineAndPantaleoni BPS2011
No ratings yet
08 gpuSoftwareRasterLaineAndPantaleoni BPS2011
40 pages

GFXHW

Uploaded by

GFXHW

Uploaded by

Graphics Hardware

 There is currently only hardware for triangle

Tomas Akenine-Mőller © 2003

 Thereis very little documentation on

Tomas Akenine-Mőller © 2003

Linear interpolation Perspective-correct interpolation

Tomas Akenine-Mőller © 2003

Tomas Akenine-Mőller © 2003

 Willdescribe these briefly, and then

Tomas Akenine-Mőller © 2003

 Again spread work among G’s

Tomas Akenine-Mőller © 2003

 Store zmax per tile, and a flag (whether cleared,

Tomas Akenine-Mőller © 2003

Tomas Akenine-Mőller © 2003

Tomas Akenine-Mőller © 2003

Tomas Akenine-Mőller © 2003

Tomas Akenine-Mőller © 2003

Tomas Akenine-Mőller © 2003

Tomas Akenine-Mőller © 2003

Tomas Akenine-Mőller © 2003

Tomas Akenine-Mőller © 2003

 CPU sends triangle data to KYRO II

Tomas Akenine-Mőller © 2003

– Groups pixels with same texture and sends to TSP

Tomas Akenine-Mőller © 2003

Tomas Akenine-Mőller © 2003

Tomas Akenine-Mőller © 2003

You might also like