Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

65
Introduction to Code Generation Mooly Sagiv html://www.cs.tau.ac.il/~msagiv/ courses/wcc10.html Chapter 4
  • date post

    20-Dec-2015
  • Category

    Documents

  • view

    222
  • download

    0

Transcript of Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Page 1: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Introduction to Code Generation

Mooly Sagiv

html://www.cs.tau.ac.il/~msagiv/courses/wcc10.html

Chapter 4

Page 2: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Structure of a simple compiler/interpreter

Lexical

analysis

Syntax

analysis

Context

analysis

Intermediate code

(AST)

Code

generation

Interpretation

Symbol Table

Runtime System

Design

PL dependent PL+pardigm dependent

Machine dependent

Page 3: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Outline

• Interpreters

• Code Generation

Page 4: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Types of Interpreters

• Recursive– Recursively traverse the tree– Uniform data representation– Conceptually clean– Excellent error detection– 1000x slower than compiler

• Iterative– Closer to CPU– One flat loop– Explicit stack– Good error detection– 30x slower than compiler– Can invoke compiler on code fragments

Page 5: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Input language (Overview)

• Fully parameterized expressions

• Arguments can be a single digit

expression digit | ‘(‘ expression operator expression ‘)’

operator ‘+’ | ‘*’

digit ‘0’ | ‘1’ | ‘2’ | ‘3’ | ‘4’ | ‘5’ | ‘6’ | ‘7’ | ‘8’ | ‘9’

Page 6: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

#include "parser.h"

#include "backend.h"static int Interpret_expression(Expression *expr) { switch (expr->type) { case 'D': return expr->value; break; case 'P': { int e_left = Interpret_expression(expr->left); int e_right = Interpret_expression(expr->right); switch (expr->oper) { case '+': return e_left + e_right; case '*': return e_left * e_right; }} break; }}void Process(AST_node *icode) { printf("%d\n", Interpret_expression(icode));}

Page 7: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

AST for (2 * ((3*4)+9))

P

*

oper

typeleft right

P

+

P

*

D

2

D

9

D

4

D

3

Page 8: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Uniform self-identifying data representation

• The types of the sizes of program data values are not known when the interpreter is written

• Uniform representation of data types– Type– Size

• The value is a pointer

Page 9: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Example: Complex Number

3.0

4.0

re:

im:

Page 10: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.
Page 11: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Status Indicator

• Direct control flow of the interpreter

• Possible values– Normal mode– Errors– Jumps– Exceptions– Return

Page 12: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Example: Interpreting C Return

PROCEDURE Elaborate return with expression statement (RWE node):

SET Result To Evaluate expression (RWE node . expression);

IF Status . mode /= Normal mode: Return mode;

SET Status . mode To Return mode;

SET Status . value TO Result;

Page 13: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Interpreting If-Statement

Page 14: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Symbol table

• Stores content of variables, named constants, …• For every variable V of type T

– A pointer to the name of V

– The file name and the line it is declared

– Kind of declaration

– A pointer to T

– A pointer to newly allocated space

– Initialization bit

– Language dependent information (e.g. scope)

Page 15: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Summary Recursive Interpreters

• Can be implemented quickly– Debug the programming language

• Not good for heavy-duty interpreter– Slow– Can employ general techniques to speed the

recursive interpreter• Memoization• Tail call elimination• Partial evaluation

Page 16: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Memoization

int fib(int n) { if (n == 0) return 0 ; if (n==1) return 1; return fib(n-1) + fib(n-2) ; }

int sfib[100] = {-1, -1, …, -1}int fib(int n) { if (sfib[n] > 0) return sfib[n]; if (n == 0) return 0 ; if (n==1) return 1; sfib[n] = fib(n-1) + fib(n-2) ; return sfib[n]; }

Page 17: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Tail Call Elimination

void a(…) { … b(); }void b(){code;}

void a(…) { … code; }void b(){code;}

Page 18: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Tail Call Elimination

void a(int n) { code if (n > 0) a(n-1); }

void a(int n) { loop: code if (n > 0) { n = n -1 ; goto loop }

Page 19: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Partial Evaluation

• Partially interpret static parts in a program

• Generates an equivalent program

Partial EvaluatorProgram Program’

Input 1Input 2

Page 20: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Example

int pow(int n, int e)

{

if (e==0)

return 1;

else return n * pow(n, e-1);

}

e=4

int pow4(int n)

{

return n * n * n *n;

}

Page 21: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Example2

Bool match(string, regexp)

{

switch(regexp) {

….

}

}

regexp=a b*

Page 22: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Partial Evaluation Generalizes Compilation

Partial EvaluatorInterpreter Program

AST ProgramInput

Page 23: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

But ….

Page 24: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Iterative Interpretation

• Closed to CPU

• One flat loop with one big case statement

• Use explicit stack– Intermediate results– Local variables

• Requires fully annotated threaded AST– Active-node-pointer (interpreted node)

Page 25: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Demo Compiler

Page 26: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Demo Compiler

Page 27: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Threaded AST• Annotated AST• Every node is connected to the immediate

successor in the execution• Control flow graph

– Nodes• Basic execution units

– expressions– assignments

– Edges• Transfer of control

– sequential– while– …

Page 28: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Threaded AST for (2 * ((3*4)+9))

P

*

oper

typeleft right

P

+

P

*

D

2

D

9

D

4

D

3

Dummy_node Start

Page 29: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

C Examplewhile ((x > 0) && (x < 10))

{

x = x + y ;

y = y – 1 ;

}

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

T

exitF

Page 30: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Threading the AST(3.2.1)

• One preorder AST pass

• Every type of AST has its threading routine

• Maintains Last node pointer – Global variable

• Set successor of Last pointer when node is visited

Page 31: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

Page 32: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

Page 33: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

Page 34: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

Page 35: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

Page 36: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

Page 37: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

Page 38: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

Page 39: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

T

Page 40: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

T

Page 41: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

T

Page 42: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

T

Page 43: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointer

main

T

Page 44: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointermain

T

Page 45: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

while

andseq

ass

ass

id+

x

idx

id

y

>

idx

const0

<

idx

const

10

+

idy

const

1

idy

Last node pointermain

First node pointer

T

Page 46: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Demo Compiler

Page 47: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Conditional Statement

if

condthen_part else_part

Last node pointer

Page 48: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Conditional Statement

if

condthen_part else_part

Last node pointer

End_If

T F

Page 49: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Iterative Interpretation

• Closed to CPU

• One flat loop with one big case statement

• Use explicit stack– Intermediate results– Local variables

• Requires fully annotated threaded AST– Active-node-pointer (interpreted node)

Page 50: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Demo Compiler

Page 51: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Conditional Statements

Page 52: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Storing Threaded AST

• General Graph

• Array

• Pseudo Instructions

Page 53: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Threaded AST as General Graph

condition

statement 1

IF

statement 2

statement 3

statement 4END

If

Page 54: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Threaded AST as Array

condition

IF

statement 1

statement 2

statement 3

statement 4

Page 55: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Threaded AST as Pseudo Instructions

condition

IFFALSE

statement 1

statement 2statement 3

statement 4

JUMP

Page 56: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Iterative Interpreters (Summary)

• Different AST representations

• Faster than recursive interpreters– Some interpretative overhead is eliminated

• Portable

• Secure

• Similarities with the compiler

Page 57: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Code Generation

• Transform the AST into machine code

• Machine instructions can be described by tree patterns

• Replace tree-nodes by machine instruction– Tree rewriting– Replace subtrees

• Applicable beyond compilers

Page 58: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

a := (b[4*c+d]*2)+9

Page 59: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

leal movsbl

Page 60: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

9

Ra

+

*

2mem

+

@b +

* Rd

Rc4

Page 61: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

9

Ra

+

*

2Rt

Load_Byte (b+Rd)[Rc], 4, Rt

Page 62: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Ra

Load_Byte (b+Rd)[Rc], 4, Rt

Load_address 9[Rt], 2, Ra

Page 63: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Code generation issues

• Code selection

• Register allocation

• Instruction ordering

Page 64: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Simplifications

• Consider small parts of AST at time

• Simplify target machine

• Use simplifying conventions

Page 65: Introduction to Code Generation Mooly Sagiv html://msagiv/courses/wcc10.html Chapter 4.

Overall Structure