0% found this document useful (0 votes)

5 views35 pages

Module 6

Module 6 covers code generation in compilers, detailing the code generator's role in producing a target program from an intermediate representation. It discusses key tasks such as instruction selection, register allocation, and memory management, as well as the structure of activation records in runtime organization. The document also highlights the importance of next-use information for optimizing register allocation and the different types of runtime environments.

Uploaded by

dasarideekshitha2021a

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views35 pages

Module 6

Uploaded by

dasarideekshitha2021a

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 35

Module 6

Code Generation
• The final phase of a compiler is code generator

• It receives an intermediate representation (IR) with supplementary

information in symbol table

• Produces a semantically equivalent target program

• Code generator main tasks:

• Instruction selection

• Register allocation and assignment

• Instruction ordering
Register and Address Descriptors

• A register descriptor is used to keep track of which variable is stored

in a register.

• The register descriptors show that initially all the registers are empty.

• An address descriptor is used to keep track of location where the

variable is stored. Location may be register, memory address or stack.
Code-generation algorithm
• The algorithm takes a sequence of three-address statements as input. For each three address statement of the form

a:= b op c perform the various actions. These are as follows:

1.Invoke a function getreg to find out the location L where the result of computation b op c should be stored.

2.Consult the address description for y to determine y'. If the value of y currently in memory and register both

then prefer the register y' . If the value of y is not already in L then generate the instruction MOV y' , L to

place a copy of y in L.

3.Generate the instruction OP z' , L where z' is used to show the current location of z. if z is in both then prefer

a register to a memory location. Update the address descriptor of x to indicate that x is in location L. If x is in

L then update its descriptor and remove x from all other descriptor.

4.If the current value of y or z have no next uses or not live on exit from the block or in register then alter the

register descriptor to indicate that after execution of x : = y op z those register will no longer contain y or z.
Generating Code for Assignment
Statements
• The assignment statement d:= (a-b) + (a-c) + (a-c) can be translated
into the following sequence of three address code

t:= a-b

u:= a-c

v:= t +u

d:= v+u
Statement Code Generated Register descriptor Address descriptor
Register empty
t:= a - b MOV a, R0 R0 contains t t in R0
SUB b, R0
u:= a - c MOV a, R1 R0 contains t t in R0
SUB c, R1 R1 contains u u in R1
v:= t + u ADD R1, R0 R0 contains v u in R1
R1 contains u v in R1
d:= v + u ADD R1, R0 R0 contains d d in R0
MOV R0, d d in R0 and memory
Generate code for the following three- 1. LD R1, #1
address statements assuming all variables ST x, R1

are stored in memory locations. 2. LD R1, a

1. x = 1 ST x, R1

3. LD R1, a
2. x = a
ADD R1, R1, #1
3. x = a + 1
ST x, R1
4. x = a + b
4. LD R1, a

LD R2, b

ADD R1, R1, R2

ST x, R1
Generating Code for Assignment
Statements
• The assignment d = (a-b)
+ (a-c) + (a-c) might be
translated into the
following three-address
code sequence:

• Code sequence for the

example is:
• The two statements
LD R1, b
x=b*c LD R2, c
y=a+x MUL R1, R1, R2
LD R3, a
ADD R3, R3, R1
ST y, R3
• The three-statement sequence
x = a[i] Answer
y = b[i] LD R1, i
z=x*y MUL R1, R1, #4
LD R2, a(R1)
LD R1, b(R1)
MUL R1, R2, R1
ST z, R1
Issues in the Design of Code Generation
• Input to the code generator
• Target program
• Memory management
• Instruction selection
• Register allocation
• Evaluation order
Input to the code generator
• The input to the code generator contains the intermediate representation of the source program and

the information of the symbol table. The source program is produced by the front end.

• Intermediate representation has the several choices:

a) Postfix notation

b) Syntax tree

c) Three address code

• We assume front end produces low-level intermediate representation i.e. values of names in it can

directly manipulated by the machine instructions.

• The code generation phase needs complete error-free intermediate code as an input requires.
Target Program

• The target program is the output of the code generator. The output can be:

a) Assembly language: It allows subprogram to be separately

compiled.

b) Relocatable machine language: It makes the process of code

generation easier.

c) Absolute machine language: It can be placed in a fixed location in

memory and can be executed immediately.

Memory Management

• During code generation process the symbol table entries have to be mapped to
actual addresses

• Mapping name in the source program to address of data is co-operating done

by the front end and code generator.

• Local variables are stack allocation in the activation record while global
variables are in static area.
Instruction Selection

• Nature of instruction set of the target machine should be complete and

uniform.

• When you consider the efficiency of target machine then the instruction
speed and machine idioms are important factors.

• The quality of the generated code can be determined by its speed and size.
Register Allocation

• Register can be accessed faster than memory. The instructions involving

operands in register are shorter and faster than those involving in
memory operand.

• The following sub problems arise when we use registers:

1. Register allocation: In register allocation, we select the set of

variables that will reside in register.

2.Register assignment: In Register assignment, we pick the register

that contains variable.
Evaluation order

• The efficiency of the target code can be affected by the order in which
the computations are performed.

• Some computation orders need fewer registers to hold results of

intermediate than others.
Target Machine
• The target computer is a type of byte-addressable machine. It has 4 bytes to a word.

• The target machine has n general purpose registers, R0, R1,...., Rn-1. It also has two-address

instructions of the form: op source, destination

Where, op is used as an op-code and source and destination are used as a data field.

• It has the following op-codes:

ADD (add source to destination)

SUB (subtract source from destination)

MOV (move source to destination)

• The source and destination of an instruction can be specified by the combination of registers and

memory location with address modes.

MODE FORM ADDRESS EXAMPLE

absolute M M Add R0, R1

register R R Add temp, R1
indexed c(R) C+ contents(R) ADD 100 (R2),
R1
indirect register *R contents(R) ADD * 100
indirect indexed *c(R) contents(c+ (R2), R1
contents(R))
literal #c c ADD #3, R1
Next-Use Information
• In compiler design, the next use information is a type of data flow analysis
that can be used to optimize the allocation of registers in a computer’s
central processing unit (CPU).

• The goal of next use analysis is to determine which variables in a program

are needed in the immediate future and should therefore be stored in a
register for faster access, rather than in main memory.

• Example x = y + z;
a = x + b;
c = x + d;
• To perform the next-use analysis, the compiler examines each instruction

in the program and determines the next time that each variable is used. If a

variable is not used again until much later in the program, it may not be

worth keeping in a register and could be stored in the main memory

instead. On the other hand, if a variable is used multiple times in quick

succession, it may be more efficient to keep it in a register and avoid the

overhead of repeatedly loading and storing it in the main memory.

• Next use analysis can be combined with other optimization techniques,

such as register allocation and live range analysis, to further improve the

performance of a compiled program.

• Register allocation is only within a basic block. It follows top-down

approach.

• Assign registers to the most heavily used variables

• Traverse the block

• Use count as a priority function

• Assign registers to higher priority variables first

Need of global register allocation

• Local allocation does not take into account that some instructions (e.g. those in loops) execute

more frequently. It forces us to store/load at basic block endpoints since each block has no

knowledge of the context of others.

• To find out the live range(s) of each variable and the area(s) where the variable is used/defined

global allocation is needed. Cost of spilling will depend on frequencies and locations of uses.

• Register allocation depends on:

• Size of live range

• Number of uses/definitions

• Frequency of execution

• Number of loads/stores needed.

• Global register allocation can be seen as a graph coloring problem.

• Basic idea:

1. Identify the live range of each variable

2. Build a register interference graph (RIG) that represents conflicts

between live ranges (two nodes are connected if the variables they
represent are live at the same moment)

3. Try to assign as many colors to the nodes of the graph as there are
registers so that two neighbors have different colors
Run time Organization

• The run-time environment is the structure of the target

computers registers and memory that serves to manage
memory and maintain information needed to guide a
programs execution process.
1. Fully Static
• Fully static runtime environment may be useful for the languages in which
pointers or dynamic allocation is not possible in addition to no support for
recursive function calls.

• Every procedure will have only one activation record which is allocated
before execution.

• Variables are accessed directly via fixed address.

2. Stack Based
• In this, activation records are allocated (push of the activation record)
whenever a function call is made.

• The necessary memory is taken from the stack portion of the program.

• When program execution return from the function, the memory used
by the activation record is deallocated (pop of the activation record).
Thus, the stack grows and shrinks with the chain of function calls.
3. Fully Dynamic
• Functional language use this style of call stack management.

• The activation record is deallocated only when all references to them

have disappeared, and this requires the activation records to
dynamically freed at arbitrary times during execution.

• Memory manager (garbage collector) is needed.

• The data structure that handles such management is heap an this is

also called as Heap Management.
Activation Records
• Information needed by a single execution of a procedure is managed
using a contiguous block of storage called “activation record”.

• An activation record is allocated when a procedure is entered and it is

deallocated when that procedure is exit.

• It contain temporary data, local data, machine status, optional access

link, optional control link, actual parameters and returned values.
contents of activation records
• Return Value: It is used by calling procedure to return a value to calling

procedure.

• Actual Parameter: It is used by calling procedures to supply parameters to

the called procedures.

• Control Link: It points to activation record of the caller.

• Access Link: It is used to refer to non-local data held in other activation

records.

• Saved Machine Status: It holds the information about status of machine

before the procedure is called.

• Local Data: It holds the data that is local to the execution of the procedure.

• Temporaries: It stores the value that arises in the evaluation of an expression.

Module 4
No ratings yet
Module 4
80 pages
CD Module 3&4
No ratings yet
CD Module 3&4
74 pages
Compiler Code Generation Basics
No ratings yet
Compiler Code Generation Basics
6 pages
Module 6 - Code Generation
No ratings yet
Module 6 - Code Generation
36 pages
Unit 6
No ratings yet
Unit 6
80 pages
Compiler Design - Unit 5 NOTES
No ratings yet
Compiler Design - Unit 5 NOTES
28 pages
Codegeneration Final
No ratings yet
Codegeneration Final
31 pages
Unit 5 1 Basicblocks
No ratings yet
Unit 5 1 Basicblocks
39 pages
UNIT 4 - Chapter 1 in Compiler Design
No ratings yet
UNIT 4 - Chapter 1 in Compiler Design
51 pages
Code Generation and Optimization
No ratings yet
Code Generation and Optimization
42 pages
13-Issues in The Design of A Code Generator - 22!10!2024
No ratings yet
13-Issues in The Design of A Code Generator - 22!10!2024
54 pages
Unit 5
No ratings yet
Unit 5
13 pages
Code Opti
No ratings yet
Code Opti
26 pages
Code Generation
No ratings yet
Code Generation
21 pages
Unit 5
No ratings yet
Unit 5
8 pages
Mod 4-5
No ratings yet
Mod 4-5
40 pages
Chapter 6 Code Generation and Optimization
No ratings yet
Chapter 6 Code Generation and Optimization
34 pages
Unit - V: Study Material 1/11
No ratings yet
Unit - V: Study Material 1/11
11 pages
Code Geneartion
No ratings yet
Code Geneartion
13 pages
CD Unit 6.1
No ratings yet
CD Unit 6.1
20 pages
34-Issues in The Design of A Code Generator - Target Machine-25-10-2024
No ratings yet
34-Issues in The Design of A Code Generator - Target Machine-25-10-2024
29 pages
Compiler Design (Unit-5)
No ratings yet
Compiler Design (Unit-5)
22 pages
Code Generation
No ratings yet
Code Generation
40 pages
Acd 5
No ratings yet
Acd 5
9 pages
Unit V
No ratings yet
Unit V
42 pages
Unit 5 Part 1 - CD
No ratings yet
Unit 5 Part 1 - CD
14 pages
CD Unit-6 LM
No ratings yet
CD Unit-6 LM
17 pages
Unit-5-Code Gen
No ratings yet
Unit-5-Code Gen
13 pages
CD Unit 5
No ratings yet
CD Unit 5
26 pages
Cdunit 6
No ratings yet
Cdunit 6
20 pages
CD Unit 5
No ratings yet
CD Unit 5
9 pages
Unit-4-5
No ratings yet
Unit-4-5
36 pages
REDO - 2 CD - PDF 2
No ratings yet
REDO - 2 CD - PDF 2
2 pages
Code Generation
No ratings yet
Code Generation
49 pages
CC 7
No ratings yet
CC 7
20 pages
Code Generation
No ratings yet
Code Generation
25 pages
Unit 5
No ratings yet
Unit 5
13 pages
UNIT V CD Print
No ratings yet
UNIT V CD Print
9 pages
Code Generation in Compilation
No ratings yet
Code Generation in Compilation
9 pages
Code Generation F
No ratings yet
Code Generation F
7 pages
Compiler Code Generation Guide
No ratings yet
Compiler Code Generation Guide
31 pages
Unit 5
No ratings yet
Unit 5
10 pages
Code Generation: Issues in The Design of A Code Generator
No ratings yet
Code Generation: Issues in The Design of A Code Generator
33 pages
5.1 Issues in Code Generation
No ratings yet
5.1 Issues in Code Generation
16 pages
Code Generation I
No ratings yet
Code Generation I
32 pages
CD Unit 5
No ratings yet
CD Unit 5
26 pages
Code Generation 5th Year Computer Science Course
No ratings yet
Code Generation 5th Year Computer Science Course
20 pages
15Cs314J - Compiler Design: Unit 4
No ratings yet
15Cs314J - Compiler Design: Unit 4
71 pages
Code Generation (Autosaved)
No ratings yet
Code Generation (Autosaved)
48 pages
Security of Cloud-Based Systems
No ratings yet
Security of Cloud-Based Systems
434 pages
Code Generation and Optimization Guide
No ratings yet
Code Generation and Optimization Guide
23 pages
Compiler Design and Construction Lecture Notes
No ratings yet
Compiler Design and Construction Lecture Notes
28 pages
Lighting Technician's Guide
No ratings yet
Lighting Technician's Guide
10 pages
Chapter 8 - Code Generation Part 1
No ratings yet
Chapter 8 - Code Generation Part 1
5 pages
Unit 4 PCD
No ratings yet
Unit 4 PCD
15 pages
Code Generation
No ratings yet
Code Generation
22 pages
Chapter 8 - Code Generation
No ratings yet
Chapter 8 - Code Generation
22 pages
Compiler Design Code Generation
No ratings yet
Compiler Design Code Generation
4 pages
Code Generation for CS Students
No ratings yet
Code Generation for CS Students
15 pages
Code Generator
No ratings yet
Code Generator
44 pages
Code Generation
No ratings yet
Code Generation
43 pages
Checkpoint r80 Vs Palo Alto Networks
No ratings yet
Checkpoint r80 Vs Palo Alto Networks
4 pages
Group 1: Introduction To Computers
No ratings yet
Group 1: Introduction To Computers
29 pages
G41T M7 PDF
50% (2)
G41T M7 PDF
30 pages
M337x - 387x - 407x - Release Note - English
No ratings yet
M337x - 387x - 407x - Release Note - English
3 pages
TOPCNC TC55V Instruction Manual
No ratings yet
TOPCNC TC55V Instruction Manual
13 pages
Solis - Manual - S6 EH1P12 16K03 NV YD L - EUR - V1020240625
No ratings yet
Solis - Manual - S6 EH1P12 16K03 NV YD L - EUR - V1020240625
75 pages
Chapter 2
No ratings yet
Chapter 2
20 pages
IMF - Hacking Etico Unidad 5
100% (1)
IMF - Hacking Etico Unidad 5
63 pages
5C00641I StudentGuide
No ratings yet
5C00641I StudentGuide
257 pages
CM2 Hfa100 2001 - 02
No ratings yet
CM2 Hfa100 2001 - 02
20 pages
Chapter 7 - Software Quality Assurance
No ratings yet
Chapter 7 - Software Quality Assurance
36 pages
HP Insight Foundation
No ratings yet
HP Insight Foundation
13 pages
Digital Temp Sensor Circuit Guide
No ratings yet
Digital Temp Sensor Circuit Guide
11 pages
Technical Equipment Troubleshooting
No ratings yet
Technical Equipment Troubleshooting
3 pages
Low Power J-Fet Quad Operational Amplifiers: TL064 TL064A - TL064B
No ratings yet
Low Power J-Fet Quad Operational Amplifiers: TL064 TL064A - TL064B
11 pages
Mca Se CT I
No ratings yet
Mca Se CT I
2 pages
5 - APIM - Administration - Cassandra - Basics
No ratings yet
5 - APIM - Administration - Cassandra - Basics
25 pages
High-Performance RF Signal Processing Solutions
No ratings yet
High-Performance RF Signal Processing Solutions
55 pages
Google Cloud Network Engineer Exam Guide
No ratings yet
Google Cloud Network Engineer Exam Guide
16 pages
DPP 2
No ratings yet
DPP 2
9 pages
Docker Iti
No ratings yet
Docker Iti
23 pages
Dynamic Voltage Divider Design
No ratings yet
Dynamic Voltage Divider Design
4 pages
Single-Carrier Phase-Disposition PWM Implementation For Multilevel Flying Capacitor Converters
No ratings yet
Single-Carrier Phase-Disposition PWM Implementation For Multilevel Flying Capacitor Converters
5 pages
Why Does The Default Gateway Route Entry Disappear After Restarting The Network Service in RHEL
No ratings yet
Why Does The Default Gateway Route Entry Disappear After Restarting The Network Service in RHEL
32 pages
RFC 1123
No ratings yet
RFC 1123
94 pages
Computer Science Exam Prep Guide
No ratings yet
Computer Science Exam Prep Guide
92 pages
1.4. Analysis of Algorithms
No ratings yet
1.4. Analysis of Algorithms
75 pages
Guardian Digital EnGarde Presentation
No ratings yet
Guardian Digital EnGarde Presentation
16 pages

Module 6

Uploaded by

Module 6

Uploaded by

Module 6

• It receives an intermediate representation (IR) with supplementary

information in symbol table

• Produces a semantically equivalent target program

• Code generator main tasks:

• Register allocation and assignment

• A register descriptor is used to keep track of which variable is stored

• An address descriptor is used to keep track of location where the

a:= b op c perform the various actions. These are as follows:

are stored in memory locations. 2. LD R1, a

ADD R1, R1, R2

• Code sequence for the

• Intermediate representation has the several choices:

c) Three address code

directly manipulated by the machine instructions.

a) Assembly language: It allows subprogram to be separately

b) Relocatable machine language: It makes the process of code

c) Absolute machine language: It can be placed in a fixed location in

memory and can be executed immediately.

• Mapping name in the source program to address of data is co-operating done

• Nature of instruction set of the target machine should be complete and

• Register can be accessed faster than memory. The instructions involving

• The following sub problems arise when we use registers:

1. Register allocation: In register allocation, we select the set of

2.Register assignment: In Register assignment, we pick the register

• Some computation orders need fewer registers to hold results of

instructions of the form: op source, destination

• It has the following op-codes:

ADD (add source to destination)

SUB (subtract source from destination)

MOV (move source to destination)

memory location with address modes.

absolute M M Add R0, R1

• The goal of next use analysis is to determine which variables in a program

worth keeping in a register and could be stored in the main memory

instead. On the other hand, if a variable is used multiple times in quick

succession, it may be more efficient to keep it in a register and avoid the

overhead of repeatedly loading and storing it in the main memory.

• Next use analysis can be combined with other optimization techniques,

performance of a compiled program.

• Register allocation is only within a basic block. It follows top-down

• Assign registers to the most heavily used variables

• Use count as a priority function

• Assign registers to higher priority variables first

knowledge of the context of others.

• Register allocation depends on:

• Size of live range

• Number of loads/stores needed.

• Global register allocation can be seen as a graph coloring problem.

1. Identify the live range of each variable

2. Build a register interference graph (RIG) that represents conflicts

• The run-time environment is the structure of the target

• Variables are accessed directly via fixed address.

• The activation record is deallocated only when all references to them

• Memory manager (garbage collector) is needed.

• The data structure that handles such management is heap an this is

• An activation record is allocated when a procedure is entered and it is

• It contain temporary data, local data, machine status, optional access

• Actual Parameter: It is used by calling procedures to supply parameters to

the called procedures.

• Control Link: It points to activation record of the caller.

• Access Link: It is used to refer to non-local data held in other activation

• Saved Machine Status: It holds the information about status of machine

before the procedure is called.

• Temporaries: It stores the value that arises in the evaluation of an expression.

You might also like