0% found this document useful (0 votes)

166 views28 pages

Intermediate Code Optimization Techniques

This document discusses intermediate code generation and optimization in compilers. It describes how producing an intermediate representation facilitates retargeting a compiler to different machines and allows for machine-independent optimizations. Common intermediate representations include graphs, postfix notation, and three-address code. The document outlines various machine-independent optimizations that can improve the intermediate code, such as peephole, local, global, loop, and inter-procedural optimizations. It also discusses basic blocks and how they are constructed from three-address instructions.

Uploaded by

zemike

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

166 views28 pages

Intermediate Code Optimization Techniques

Uploaded by

zemike

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

CHAPTER SIX

Intermediate Code Generation and

Optimization

Outline
 Introduction
 Intermediate-Code Generation
 Machine-Independent Optimizations
6.1 Introduction: Structure of a Compiler
6.2 Intermediate Code Generation

 Although a compiler can directly produce a target language

(i.e. machine code or assembly of the target machine),
producing a machine independent intermediate representation
has the following benefits.
 Retargeting to another machine is facilitated.
 Intermediate code representation is neutral in relation to target
machine, so the same intermediate code generator can be
shared for all target languages (machines).
 Build a compiler for a new machine by attaching a new code
generator to an existing front-end
 Machine independent code optimization can be applied to
intermediate code.
Compiling Process without
Intermediate Representation

C SPARC

Pascal HP PA

FORTRAN x86

C++ IBM PPC

Compiling Process with Intermediate
Representation

C SPARC

Pascal HP PA
IR
FORTRAN x86

C++ IBM PPC

10
Methods of Intermediate Code (IC) Generation

Intermediate language can be many different languages,

and the designer of the compiler decides this intermediate
Language. Common IRs:
 Graphical Representation: such as syntax trees, AST
(Abstract Syntax Trees), DAG
 Postfix Notation: the abstract syntax tree is linearized as a
sequence of data references and operations.
 For instance, the tree for : a * ( 9 + d ) can be mapped to the
equivalent postfix notation: a9d+*
 Three-address Code: All operations are represented as a 4-
part list in quadruples:
 (op, arg1, arg2, result). E.g., x := y + z -> (+ y z x)
Direct Acyclic Graph (DAG) Representation

 Example: F = ((A+BC) (ABC))+C

= =
F + +
F

* *
C
+ * + *
A
* * *
A
B C B C A
B C
DAG Syntax tree
A syntax tree depicts the natural hierarchical structure of a
source program. A DAG gives the same information but in
compact way because common expressions are identified
Postfix Notation: PN

 A mathematical notation wherein every operator follows all

of its operands.
 Or a list of nodes of a tree in which a node appears
immediately next to its children.
Example: PN of expression a* (b+c) is abc+*
How about (a+b)/(c-d)
 Form Rules:
 If E is a variable/constant, the PN of E is E itself.
 If E is an expression of the form E1 op E2, the PN of E is
E1 ’E2 ’op (E1 ’ and E2 ’ are the PN of E1 and E2,
respectively.)
 If E is a parenthesized expression of form (E1), the PN
of E is the same as the PN of E1.
Three Address Code
 The general form:x = y op z
 x,y,and z are names, constants, compiler-generated temporaries
 op stands for any operator such as +,-,….

 We use the term “three-address code” because each statement

usually contains three addresses (two for operands, one for the
result).
 A popular form of intermediate code used in optimizing
compilers is three-address statements.
 Linearized representation of syntax tree with explicit names
given to interior nodes.
 There is only one operator in the right. Thus a source language
expression like : a+b*c might be translated into a sequence with
temporaries t1 and t2
t1 = b* c
t2 = a + t1
DAG vs. Three Address Code
 Three address code is a linearized representation of
a syntax tree (or a DAG) in which explicit names
(temporaries) correspond to the interior nodes of the
graph.
Expression: F = ((A+B*C) * (A*B*C))+C
=
T1 := A T1 := B * C
F + T2 := C T2 := A+T1
T3 := B * T2 T3 := A*T1
T4 := T1+T3 T4 := T2*T3
* T5 := T1*T3 T5 := C
+ * T6 := T4 * T5 T6 := T4 + T5
T7 := T6 + T2 F := T6
A
* F := T7
B C
Syntax tree DAG

Question: Which IR code sequence is better?

Implementation of Three Address Code

• Quadruples
Four fields: op, arg1, arg2, result
Array of struct {op, *arg1, *arg2, *result}
 x:=y op z is represented as op y, z, x
arg1, arg2 and result are usually pointers to symbol table
entries.
May need to use many temporary names.
Many assembly instructions are like quadruple, but arg1,
arg2, and result are real registers.
• Triples
Three fields: op, arg1, and arg2. Result become implicit.
arg1 and arg2 can be pointers to the symbol table.
5/31/2015 \course\cpeg621-10F\[Link] 11
Types of Three-Address Statements

 Assignment statements:
 x := y op z, where op is a binary operator add a,b,c
 x := op z, where op is a unary operator not a, ,c or intoreal a, ,c
 Copy statements
 x := y mov a, ,c
 The unconditional jumps:
 goto L jump , ,L1
 Conditional jumps:
 if x relop y goto L jmprelop y,z,L or if y relop z goto L
 param x and call p, n and return y relating to procedure calls
Eg: f(x+1,y)  add x,1,t1
param t1, ,
param y, ,
call f,2,
 Indexed assignments:
 x := y[i]
 x[i] := y
 Address and pointer assignments:
 x := &y, x := *y, and *x = y
6.3 Code Optimization:
Summary of Front End

Lexical Analyzer (Scanner)

+
Syntax Analyzer (Parser)
+ Semantic Analyzer

Front
Abstract Syntax Tree w/Attributes End

Intermediate-code Generator

Error Non-optimized Intermediate Code

Message
5/31/2015 \course\cpeg621-10F\[Link] 13
Code Optimization

• The machine-independent code-optimization phase attempts to

improve the intermediate code so that better target code will
result.
• Usually better means faster, but other objectives may be
desired, such as shorter code, or target code that consumes less
power.
• A simple intermediate code generation algorithm followed by
code optimization is a reasonable way to generate good target
code.
How Compiler Improves Performance
• Execution time = Operation count * Machine cycles per
operation
• Minimize the number of operations
• Arithmetic operations, memory accesses
• Replace expensive operations with simpler ones
• E.g., replace 4-cycle multiplication with1-cycle shift
• Minimize cache misses
• Both data and instruction accesses
• Perform work in parallel
• Instruction scheduling within a thread
• Parallel execution across multiple threads
Code Optimization

• There is a great variation in the amount of code optimization

different compilers perform.
• In those that do the most, the so called “optimizing compilers”,
take significant time in this phase.
• Trade off between compilation time and degree of optimization
Why to use optimization:
• There are simple optimizations that significantly improve the
running time of target program without slowing down
compilation too much
Types of Optimization

• Peephole
• Local
• Global
• Loop
• Inter-procedural, whole-program or link-time
• Machine code
• ….
Basic Blocks

 Basic blocks are maximal sequences of consecutive three-

address instructions.
 The flow of control can only enter the basic block through the
first instruction in the block. (no jumps into the middle of the
block )
 Control will leave the block without halting or
branching, except possibly at the last instruction in the
block.
 The basic blocks become the nodes of a flow graph,
whose edges indicate which blocks can follow which
other blocks.
Construction of Basic Blocks
 Input: A sequence of three-address instructions
 Output: A list of the basic blocks for that sequence in
which each instruction is assigned to exactly one basic
block
 Method: Determine instructions in the intermediate code that
are leaders:
 The rules for finding leaders are:
 The first three-address instruction in the intermediate code
 Any instruction that is the target of a conditional or
unconditional jump
 Any instruction that immediately follows a conditional or
unconditional Jump is a leader
Construction Partitioning Three-address
Instructions in to Basic Blocks
1. i=1
 First, instruction 1 is a leader by rule (1).
2. j=1
Jumps are at instructions 6, 8, and 11. By 3. t1 = 10 * i
rule (2), the targets of these jumps are 4. t2 = t1 + j
leaders ( instructions 3, 2, and 10, 5. j=j+1
respectively) 6. if j <= 10 goto (3)
 By rule (3), each instruction following a 7. i=i+1
jump is a leader; instructions 7 and 9. 8. if i <= 10 goto (2)
 Leaders are instructions 1, 2, 3, 7, 9 and 9. i=1
10. t3 = i – 1
10. The basic block of each leader
11. if i <= 10 goto (10)
contains all the instructions from itself
until just before the next leader.
Flow Graphs
 Flow Graph is the representation of control flow between
basic blocks. The nodes of the flow graph are the basic blocks.
 There is an edge from block B to block C if and only if it is
possible for the first instruction in block C to immediately
follow the last instruction in block B. There are two ways that
such an edge could be justified:
1. There is a conditional or unconditional jump from the end

of B to the beginning of C.
2. C immediately follows B in the original order of the three-
address instructions, and B does not end in an
unconditional jump.
 B is a predecessor of C, and C is a successor of B.
Flow Graphs: Example
Flow Graph Example of program in Example(1).
The block led by first statement of the program is the
start, or entry node.
Entry
Exit
B1: i = 1
B6: t3 = i – 1
B2: j = 1 if i <= 10 goto (10)

B3: t1 = 10 * i B5: i = 1
t2 = t1 + j
j=j+1 B4: i = i + 1
if j <= 10 goto (3) if i <= 10 goto (2)

22
Representation of Basic Blocks

• Each basic block is represented by a record

consisting of
– a count of the number of statements
– a pointer to the leader
– a list of predecessors
– a list of successors

23
Peephole Optimization
• Improve the performance of the target program by
examining and transforming a short sequence of
target instructions
• Depends on the window size
• May need repeated passes over the code
Examples Redundant loads and stores
MOV R0, a
MOV a, Ro
• Algebraic Simplification
x := x + 0
x := x * 1
• Constant folding
x := 2 + 3 x := 5
y := x + 3 y := 8
Local Optimizations
 Analysis and transformation performed within a basic block
 No control flow information is considered
 Examples of local optimizations:
 Local common sub expression elimination
analysis: same expression evaluated more than once.
transformation: replace with single calculation
 Local constant folding or elimination
analysis: expression can be evaluated at compile time
transformation: replace by constant, compile-time value
 Dead code elimination

25
Global Optimizations:

Intraprocedural
 Global versions of local optimizations
 Global common sub-expression elimination
 Global constant propagation
 Dead code elimination

 Loop optimizations
 Reduce code to be executed in each iteration

26
Examples

• Unreachable code
#define debug 0
if (debug) (print debugging information)

if 0 <> 1 goto L1
print debugging
information L1:

if 1 goto L1
print debugging information
L1:
27
Examples

• Flow-of-control optimization

goto L1 goto L2
… …
L1: goto L2 L2: …

goto L1 if a < b goto L2

… …
L1: if a < b goto L2

Code Optimization Techniques Explained
No ratings yet
Code Optimization Techniques Explained
50 pages
Compiler Design: Intermediate Code Techniques
No ratings yet
Compiler Design: Intermediate Code Techniques
119 pages
Compiler Structure and Intermediate Code
No ratings yet
Compiler Structure and Intermediate Code
34 pages
Constant Folding in Code Optimization
No ratings yet
Constant Folding in Code Optimization
12 pages
Intermediate Code Generation Explained
No ratings yet
Intermediate Code Generation Explained
14 pages
Intermediate Code Generation in Compilers
No ratings yet
Intermediate Code Generation in Compilers
39 pages
Postfix Notation in Compiler Design
No ratings yet
Postfix Notation in Compiler Design
4 pages
Three Address Code in Compiler Design
No ratings yet
Three Address Code in Compiler Design
21 pages
Intermediate Code Generation Overview
No ratings yet
Intermediate Code Generation Overview
23 pages
Understanding Basic Blocks and Optimizations
No ratings yet
Understanding Basic Blocks and Optimizations
14 pages
Intermediate Code Generation Techniques
No ratings yet
Intermediate Code Generation Techniques
27 pages
Intermediate Code Generation Techniques
No ratings yet
Intermediate Code Generation Techniques
66 pages
Intermediate Code Generation in Compilers
No ratings yet
Intermediate Code Generation in Compilers
29 pages
Static Single Assignment in Code Optimization
No ratings yet
Static Single Assignment in Code Optimization
40 pages
lecture 5 (1)
No ratings yet
lecture 5 (1)
41 pages
Intermediate Code Generation Overview
No ratings yet
Intermediate Code Generation Overview
32 pages
Intermediate Code Generation in Compilers
No ratings yet
Intermediate Code Generation in Compilers
10 pages
Three-Address Code Syntax Overview
No ratings yet
Three-Address Code Syntax Overview
28 pages
Intermediate Code Generation Techniques
No ratings yet
Intermediate Code Generation Techniques
12 pages
Code Optimization Techniques Explained
No ratings yet
Code Optimization Techniques Explained
12 pages
Intermediate Code Generation Explained
No ratings yet
Intermediate Code Generation Explained
47 pages
Instruction Selection in Code Generation
No ratings yet
Instruction Selection in Code Generation
19 pages
Code Generation in Compiler Design
No ratings yet
Code Generation in Compiler Design
62 pages
Intermediate Code Generation in Compilers
No ratings yet
Intermediate Code Generation in Compilers
60 pages
DAG in Compiler Design Overview
No ratings yet
DAG in Compiler Design Overview
36 pages
Unit 5
No ratings yet
Unit 5
126 pages
Compiler Design: Intermediate Code Insights
No ratings yet
Compiler Design: Intermediate Code Insights
23 pages
Register Allocation in Code Generation
No ratings yet
Register Allocation in Code Generation
64 pages
Code Optimization Techniques in Compilers
No ratings yet
Code Optimization Techniques in Compilers
41 pages
Simple Code Generator Design Issues
No ratings yet
Simple Code Generator Design Issues
62 pages
Assembly Code Generation for Expressions
No ratings yet
Assembly Code Generation for Expressions
43 pages
Intermediate Code Generation in Compilers
No ratings yet
Intermediate Code Generation in Compilers
19 pages
Code Generation in Compiler Design
No ratings yet
Code Generation in Compiler Design
74 pages
Intermediate Code Generation Explained
No ratings yet
Intermediate Code Generation Explained
18 pages
Indirect Triples in Three-Address Code
No ratings yet
Indirect Triples in Three-Address Code
83 pages
Intermediate Code Generation Techniques
No ratings yet
Intermediate Code Generation Techniques
92 pages
Intermediate Code Forms in Compiler Design
No ratings yet
Intermediate Code Forms in Compiler Design
26 pages
DAG in Compiler Code Generation
No ratings yet
DAG in Compiler Code Generation
27 pages
Code Generation and Optimization Techniques
No ratings yet
Code Generation and Optimization Techniques
34 pages
Code Generation and Optimization Techniques
No ratings yet
Code Generation and Optimization Techniques
11 pages
Code Generation Issues in Compiler Design
No ratings yet
Code Generation Issues in Compiler Design
13 pages
Code Motion in Loop Optimization
No ratings yet
Code Motion in Loop Optimization
12 pages
DAG Representation in Compiler Design
No ratings yet
DAG Representation in Compiler Design
59 pages
Intermediate Code Generation Explained
No ratings yet
Intermediate Code Generation Explained
55 pages
CS-603 Compiler Design Overview
No ratings yet
CS-603 Compiler Design Overview
14 pages
Basic Blocks and Loop Optimization Techniques
No ratings yet
Basic Blocks and Loop Optimization Techniques
39 pages
Compiler Design: Basic Blocks & Optimization
No ratings yet
Compiler Design: Basic Blocks & Optimization
126 pages
Code Optimization Techniques Explained
No ratings yet
Code Optimization Techniques Explained
49 pages
Code Generation in System Software
No ratings yet
Code Generation in System Software
27 pages
Code Optimization Techniques Explained
No ratings yet
Code Optimization Techniques Explained
65 pages
Three Address Code for While Loops
No ratings yet
Three Address Code for While Loops
51 pages
Intermediate-Code Generation Techniques
No ratings yet
Intermediate-Code Generation Techniques
97 pages
Intermediate Code Generation Overview
No ratings yet
Intermediate Code Generation Overview
23 pages
Chapter 6 SPCC-1
No ratings yet
Chapter 6 SPCC-1
20 pages
220 Runtime Environments
No ratings yet
220 Runtime Environments
8 pages
NARE: Needs Assessment for Refugees
No ratings yet
NARE: Needs Assessment for Refugees
12 pages
Modern Compiler Implementation in C
100% (2)
Modern Compiler Implementation in C
23 pages
Code Optimization Techniques Explained
No ratings yet
Code Optimization Techniques Explained
19 pages
Code Optimization Techniques Explained
No ratings yet
Code Optimization Techniques Explained
19 pages
Syntax Analysis and Parsing Techniques
No ratings yet
Syntax Analysis and Parsing Techniques
57 pages
Syntax Directed Translation Overview
No ratings yet
Syntax Directed Translation Overview
24 pages
8085 Microprocessor Instruction Set
No ratings yet
8085 Microprocessor Instruction Set
2 pages
8085 Microprocessor Program Examples
100% (28)
8085 Microprocessor Program Examples
112 pages
Introduction to Compiler Design Course
No ratings yet
Introduction to Compiler Design Course
22 pages
Memory Interfacing with 8085 Microprocessor
No ratings yet
Memory Interfacing with 8085 Microprocessor
42 pages
Word Based Statistical Machine Translation From English Text To Indian Sign Language
No ratings yet
Word Based Statistical Machine Translation From English Text To Indian Sign Language
8 pages
Integrated Technology Transfer Model Framework
No ratings yet
Integrated Technology Transfer Model Framework
14 pages
6th Pay Commission Salary Calculator
100% (436)
6th Pay Commission Salary Calculator
15 pages
English To Yorùbá Machine Translation System Using Rule-Based Approach
No ratings yet
English To Yorùbá Machine Translation System Using Rule-Based Approach
6 pages
Intel 8085 Microprocessor Overview
100% (1)
Intel 8085 Microprocessor Overview
119 pages
Understanding "Tort" in Legal Contexts
No ratings yet
Understanding "Tort" in Legal Contexts
10 pages
Natural Language Processing Overview
No ratings yet
Natural Language Processing Overview
4 pages
Future of Machine Translation Insights
No ratings yet
Future of Machine Translation Insights
23 pages
Sketch of A Noisy Channel Model For The Translation Process: Michael Carl Moritz Schaeffer
No ratings yet
Sketch of A Noisy Channel Model For The Translation Process: Michael Carl Moritz Schaeffer
46 pages
Understanding "Tort" in Legal Contexts
No ratings yet
Understanding "Tort" in Legal Contexts
10 pages
Integrated Technology Transfer Model Framework
No ratings yet
Integrated Technology Transfer Model Framework
14 pages
Compiler Construction Question Bank
No ratings yet
Compiler Construction Question Bank
6 pages
CS3501 Compiler Design Lab Manual
No ratings yet
CS3501 Compiler Design Lab Manual
54 pages
Non-Recursive Predictive Parsing Explained
0% (1)
Non-Recursive Predictive Parsing Explained
41 pages
Planning Consultant CV: Nikhil Raj KN
No ratings yet
Planning Consultant CV: Nikhil Raj KN
3 pages
Token Generation from Lexemes in C++
No ratings yet
Token Generation from Lexemes in C++
5 pages
uShareIt Provider Errors and Logs
No ratings yet
uShareIt Provider Errors and Logs
6 pages
Compiler Design Model Question Paper
No ratings yet
Compiler Design Model Question Paper
6 pages
Yacc Parser Generator Overview
No ratings yet
Yacc Parser Generator Overview
17 pages
Linker Types and Their Functions
No ratings yet
Linker Types and Their Functions
4 pages
Assembly Code for Real and Protected Modes
No ratings yet
Assembly Code for Real and Protected Modes
3 pages
IPatch Implementation Methodology
No ratings yet
IPatch Implementation Methodology
36 pages
Parsing Techniques in Compiler Design
No ratings yet
Parsing Techniques in Compiler Design
3 pages
Compilers Course Overview: CS31003
No ratings yet
Compilers Course Overview: CS31003
12 pages
Compiler Construction Overview
No ratings yet
Compiler Construction Overview
18 pages
Fluor's SmartPlant Foundation Integration
No ratings yet
Fluor's SmartPlant Foundation Integration
22 pages
Input Buffering in Compiler Design
No ratings yet
Input Buffering in Compiler Design
14 pages
Artculo Safety Science
No ratings yet
Artculo Safety Science
11 pages
Dynamics 365 Success by Design Implementation Guide V1-3-1-Compressed
No ratings yet
Dynamics 365 Success by Design Implementation Guide V1-3-1-Compressed
742 pages
x86 Assembly Basics and GCC Syntax
No ratings yet
x86 Assembly Basics and GCC Syntax
28 pages
Compiler Design Practice Questions
No ratings yet
Compiler Design Practice Questions
5 pages
The Pmo Paper
No ratings yet
The Pmo Paper
7 pages
Memory Management: Paging and Segmentation
No ratings yet
Memory Management: Paging and Segmentation
2 pages
Phases of Compiler Explained
No ratings yet
Phases of Compiler Explained
15 pages
Overview of Three-Address Code in Compilers
No ratings yet
Overview of Three-Address Code in Compilers
3 pages
Chapter 2: Capstone Project Guidelines
No ratings yet
Chapter 2: Capstone Project Guidelines
24 pages
Compiler Design Course Overview
No ratings yet
Compiler Design Course Overview
48 pages
Compiler Phases Explained with Examples
No ratings yet
Compiler Phases Explained with Examples
5 pages
Link Test Script Overview
No ratings yet
Link Test Script Overview
8 pages
Placement Management System Overview
No ratings yet
Placement Management System Overview
47 pages
Integrated Risk Management: Implementation Guide
No ratings yet
Integrated Risk Management: Implementation Guide
102 pages

Intermediate Code Optimization Techniques

Uploaded by

Intermediate Code Optimization Techniques

Uploaded by

CHAPTER SIX

Intermediate Code Generation and

 Although a compiler can directly produce a target language

C++ IBM PPC

C++ IBM PPC

Intermediate language can be many different languages,

 Example: F = ((A+B*C) * (A*B*C))+C

 A mathematical notation wherein every operator follows all

 We use the term “three-address code” because each statement

Question: Which IR code sequence is better?

Lexical Analyzer (Scanner)

Error Non-optimized Intermediate Code

• The machine-independent code-optimization phase attempts to

• There is a great variation in the amount of code optimization

 Basic blocks are maximal sequences of consecutive three-

• Each basic block is represented by a record

goto L1 if a < b goto L2

You might also like

 Example: F = ((A+BC) (ABC))+C