ESC Java. Static Analysis Spectrum Power Cost Type checking Data-flow analysis Model checking...

ESC Java

Static Analysis Spectrum

CostType

checking

Data-flow

analysis

Modelchecking

Programverification

Automated Manual

Is This Program Correct?

int square(int n) {

int k = 0, r = 0, s = 1;

while(k != n) {

r = r + s; s = s + 2; k = k + 1;

return r;

• Type checking not enough to check this– Neither is data-flow analysis, nor model checking

Program Verification

• Program verification is the most powerful static analysis method– Can reason about all properties of programs

• Cannot fully automate

• But …– Can automate certain parts (ESC/Java)– Teaches how to reason about programs in a

systematic way

Specifying Programs

• Before we check a program we must specify what it does

• We need formal specifications– English comments are not enough

• We use logic notation– Theory of pre- and post-conditions

State Predicates

• A predicate is a boolean expression on the program state (e.g., variables, object fields)

• Examples:– x == 8– x < y– true– false– (8i. 0 <= i < a.length ) a[i] >= 0)

Using Predicates to Specify Programs

• We focus first on how to specify a statement

• Hoare triple for statement S

{ P } S { Q }

• Says that if S is started in a state that satisfies P, and S terminates, then it terminates in Q– This is the liberal version, which doesn’t care about

termination– Strict version: if S is started in a state that satisfies P

then S terminates in Q

precondition postcondition

Hoare Triples. Examples.

• { true } x = 12 { x == 12 }

• { y >= 0 } x = 12 { x == 12 }

• { true } x = 12 { x >= 0 } (Programs satisfy many possible specifications)

• { x < 10 } x = x + 1 { x < 11 }

• { n >= 0 } x = fact(n) { x == n ! }

• { true } a = 0; if(x != 0) { a = 2 * x; } { a == 2*x }

Computing Hoare Triples

• We compute the triples using rules– One rule for each statement kind

– Rules for composed statements

Assignment

• Assignment is the simplest operation and the trickiest one to reason about !

• { y >= 2 } x = 5 { ? }

• { x == y } x = x + 1 { ? }

• { ? } x = 5 { x == y }

• { ? } x = x + 1 { x == y }

• { ? } x = x + 1 { x2 + y2 == z2 }

• { x2 + y2 == z2 } x = x + 1 { ? }

Assignment Rule

• Rule for assignment

{ Q[x := E] } x = E { Q}

• Examples: – { 12 == 12 } x = 12 { x == 12 }

– { 12 >= 0 } x = 12 { x >= 0 }– { ? } x = x + 1 { x >= 0 }– { x >= 1 } x = x + 1 { ? }

Q with x replaced by E

x == 12 with x replaced by 12

Relaxing Specifications

• Consider { x >= 1 } x = x + 1 { x >= 2 }– It is very tight specification. We can relax it

• Example: { x >= 5 } x = x + 1 { x >= 2 } (since x >= 5 ) x + 1 >= 2)

{ P } if P ) Q[x:=E]

Assignments: forward and backward

• Two ways to look at the rules:– Backward: given post-condition, what is pre-

condition?

– Forward: given pre-condition, what is post-condition?

{ ??? }

condition?

{ Q [ x := E] }

{ ??? }

condition?

{ Q [ x := E] }

Example of running it forward

• { x == y } x = x + 1 { ? }

Example of running it forward

• { x == y } x = x + 1 { ? }

Forward or Backward

• Forward reasoning– Know the precondition– Want to know what postconditon the code establishes

• Backward reasoning– Know what we want to code to establish– Must find in what precondition this happens

• Backward is used most often– Start with what you want to verify– Instead of verifying everything the code does

Weakest precondition

• wp(S, Q) is the weakest P such that { P } S { Q } – Order on predicates: Strong ) Weak– wp returns the “best” possible predicate

• wp(x := E, Q) = Q[x := E]

• In general:

{ P } if P ) wp(S,Q)

Weakest precondition

• This points to a verification algorithm:– Given function body annotated with pre-condition P

and post-condition Q:• Compute wp of Q with respect to functon body• Ask a theorem prover to show that P implies the wp

• The wp function we will use is liberal (P does not guarantee termination)– If using both strict and liberal in the same context, the

usual notation is wlp the liberal version and wp for the strict one

Strongest postcondition

• sp(S, P) is the strongest Q such that { P } S { Q } – Recall: Strong ) Weak– sp returns the “best” possible predicate

• sp(x := E, P) = …

• In general:

{ Q } if sp(S,P) ) Q

Strongest postcondition

• Strongest postcondition and weakest preconditions are symmetric

• This points to an equivalent verification algorithm:– Given function body annotated with pre-condition P

and post-condition Q:• Compute sp of P with respect to functon body• Ask a theorem prover to show that the sp implies Q

Composing Specifications

• If { P } S1 { R } and { R } S2 { Q}

then { P } S1; S2 { Q }

• Example:

x = x - 1;

y = y - 1

{ x >= y }

Composing Specifications

• If { P } S1 { R } and { R } S2 { Q}

then { P } S1; S2 { Q }

• Example:

x = x - 1;

y = y - 1

{ x >= y }

In terms of wp and sp

• wp(S1; S2 , Q) = wp(S1,wp(S2, Q))

• sp(S1; S2, P) = sp(S2,sp(S1, P))

Conditionals

• Rule for the conditional (flow graph)

• Example:

{ P1 }if P && E ) P1

{ P2 }if P && ! E ) P2

x == 0

{ x >= 0 }T F

{ x == 0 }since x >= 0 && x == 0 ) x == 0

{ x >= 1 }since x >= 0 && x != 0 ) x >= 1

Conditionals: Forward and Backward

• Recall: rule for the conditional

• Forward: given P, find P1 and P2

– pick P1 to be P && E, and P2 to be P && ! E

• Backward: given P1 and P2, find P

– pick P to be (P1 && E) || (P2 && ! E)

– Or pick P to be (E ) P1) && (! E ) P2)

{ P1 }provided P && E ) P1

{ P2 }provided P && ! E ) P2

• Rule for the join:

• Forward: pick P to be P1 || P2

• Backward: pick P1, P2 to be P

{ P1 } { P2 }

{ P } provided P1 ) P and P2 ) P

Review

{ P1 }if P && E ) P1

{ P2 }if P && ! E ) P2

{ P1 } { P2 }

if P1 ) P and P2 ) P

if P ) Q[x:=E]

Implication is always in the direction of the control flow

Review: forward

{ P && E }

{ P && ! E }

{ P1 } { P2 }

{ P1 || P2 }x = E

{ \exists … }

Review: backward

{(E ) P1) && (! E ) P2) }

{ P1 }

{ P2 }

{ P } { P }

{ P }x = E

{Q[x:=E] }

Example: Absolute value

static int abs(int x)//@ ensures \result >= 0{ if (x < 0) { x = -x; } if (c > 0) { c--; } return x;}

x = -x

In Simplify

> (IMPLIES TRUE (AND (IMPLIES (< x 0) (AND (IMPLIES (> c 0) (>= (- 0 x) 0)) (IMPLIES (<= c 0) (>= (- 0 x) 0)))) (IMPLIES (>= x 0) (AND (IMPLIES (> c 0) (>= x 0)) (IMPLIES (<= c 0) (>= x 0))))))1: Valid.

So far…

• Framework for checking pre and post conditions of computations without loops

• Suppose we want to check that some condition holds inside the computation, rather than at the end

static int abs(int x) { if (x < 0) { x = -x; } if (c > 0) { c--; } return x;}

Say we want to check that x > 0 here

Asserts

• { Q && E } assert(E) { Q }

• Backward: wp(assert(E), Q) = Q && E

• Forward: sp(assert(E), P) = ???

assert(E)

Q && E

assert(E)

Example: Absolute value with assert

static int abs(int x){ if (x < 0) { x = -x; assert(x > 0); } if (c > 0) { c--; } return x;}

x = -x

assert(x > 0)

x = -x

assert(x > 0)

x = -x

assert(x > 0)

Adding the postcondition back in

x = -x

assert(x > 0)

Adding the postcondition back in

x = -x

assert(x > 0)

Another Example: Double Locking

“An attempt to re-acquire an acquired lock or release a released lock will cause a deadlock.”

Calls to lock and unlock must alternate.

unlock

Locking Rules

• We assume that the boolean predicate locked says if the lock is held or not

• { ! locked && P[locked := true] } lock { P }– lock behaves as assert(! locked); locked = true

• { locked && P[locked := false] } unlock { P }– unlock behaves as assert(locked); locked = false

Locking Example

unlock

{ ! L && P[L := true] } lock { P }

{ L && P[L := false] } unlock { P }

{ ! L }

Locking Example

unlock

{ ! L && P[L := true] } lock { P }

{ L && P[L := false] } unlock { P }

{ ! L }

Locking Example: forward direction

unlock

{ ! locked }

{ ! locked && x == 0 }{ ! locked && x 0 }

{ ! locked && x == 0 }

{ locked && x == 0 }

{ locked = (x == 0) }

{ locked && x == 0 }

{ ! locked && (x == 0) }

{ ! locked && x 0 }

{ ! locked }

In Simplify

(DEFPRED (isLocked L) (EQ L locked))

(IMPLIES (NOT (isLocked L)) (AND (IMPLIES (EQ x 0) (AND (NOT (isLocked L)) (AND (IMPLIES (EQ x 0) (AND TRUE (NOT FALSE))) (IMPLIES (NEQ x 0) (NOT (AND TRUE (NOT FALSE))))))) (IMPLIES (NEQ x 0) (AND (IMPLIES (EQ x 0) (AND (isLocked L) (NOT FALSE))) (IMPLIES (NEQ x 0) (NOT (AND (isLocked L) (NOT FALSE))))))))

1: Valid.

Reasoning About Programs with Loops

• Loops can be handled using conditionals and joins

• Consider the while(E) S statement

T FS { Q }

{ I && E }

if (1) P ) I (loop invariant holds initially)

and (2) I && ! E ) Q (loop establishes the postcondition)

and (3) { I && E } S { I } (loop invariant is preserved)

Loop invariant

Loops in the backward direction

• Given Q, want to find weakest invariant I that will establish (2) and (3), then pick P to be I

• Finding weakest I is:– Undecidable in theory– Hard in practice

T FS { Q }

{ I && E }

Loop invariant

Loops in the forward direction

• Given P, want to find strongest invariant I that will establish (1) and (3), then pick Q to be I && E

• Again, finding I is hard

T FS { Q }

{ I && E }

Loop invariant

Loop Example

• Let’s verify { x ==8 && y ==16 } while(x > 0) { x --; y -= 2; } { y==0 }– Is this true ?

– We must find an appropriate invariant I• Try one that holds initially x == 8 && y == 16• Try one that holds at the end y == 0

{ x == 8 && y == 16 }

T Fx --;

y -= 2 { y == 0 }

{ I && x > 0 }

Loop Example (II)

• Guess the invariant y == 2*x

• Must check– Initial: x == 8 && y == 16 ) y == 2*x– Preservation: y == 2*x && x > 0 ) y – 2 = 2*(x – 1) – Final: y == 2*x && x <= 0 ) y == 0

{ x == 8 && y == 16 }

T Fx --;

y -= 2 { y == 0 }

{ y == 2*x }

{ y == 2*x && x > 0 }

Loop Example (III)

• Guess the invariant y == 2*x && x >= 0

• Must check– Initial: x == 8 && y == 16 ) y == 2*x && x >= 0– Preservation: y == 2*x && x >= 0 && x > 0 ) y – 2 = 2*(x – 1) && x – 1 >= 0– Final: y == 2*x && x >= 0 && x <= 0 ) y == 0

{ x == 8 && y == 16 }

T Fx --;y -= 2 { y == 0 }

{ y == 2*x && x >= 0}

{ y == 2*x && x >= 0 && x > 0 }

Functions

• Consider a binary search function bsearchint bsearch(int a[], int p) {

{ sorted(a) }

{ r == -1 || (r >= 0 && r < a.length && a[r] == p) }

return res;

• The precondition and postconditon are the function specification– Also called a contract

Postcondition

Precondition

Function Calls

• Consider a call to function F(int in)– With return variable out– With precondition Pre, postcondition Post

• Rule for function call:

y = F(E)

{ P } if P ) Pre[in:=E] }

{ Q } and Post[out := y, in := E] ) Q

Function Call Example

• Consider the call{ sorted(array) }

y = bsearch(array, 5)

if( y != -1) {

{ array[ y ] == 5 }

• Show Post[r := y, a := array, p := 5]

) array[y] == 5– Need to know y != -1 !

• Show sorted[array] ) Pre[a := array]

Function Calls: backward

y = F(E)

Function Calls: backward

y = F(E)

Pointers and aliasing

x = *y + 1

{ ??? }

{ x == 5 }

Pointers and aliasing

x = *y + 1

{ *y == 4 }

{ x == 5 }

Regular rule worked in this case!

Example where regular rule doesn’t work

x = *y + 1

{ ??? }

{ x == *y + 1 }

x = *y + 1

{ y != &x Æ x == *y + 1 }

{ x == *y + 1 }

Pointer stores

*x = y + 1

{ ??? }

{ y == 5 }

Pointer stores

*x = y + 1

{ (x == &y ) y + 1 == 5) Æ (x != &y ) y == 5) }

{ y == 5 }

One solution

• Perform case analysis based on all the possible alias relationships between the LHS of the assignment and part of the postcondition

• Can use a static pointer analysis to prune some cases out

• However, exponentially many cases in the pointer analysis, which leads to large formulas.

• eg, how many cases here: *x = *y + a

{ *z == *v + b }

Another solution

• Up until now the program state has been implicit. Let’s make the program state explicit...

• A predicate is a function from program states to booleans.

• So for wp(S, Q), we have:– Q() returns true if Q holds in – wp(S, Q)() returns true if wp(S, Q) holds in

New formulation of wp

• Suppose step(S, ) returns the program state resuling from executing S starting in program state .

• Then we can express wp as follows:

wp(S, Q)() =

New formulation of wp

• Suppose step(S, ) returns the program state resuling from executing S starting in program state .

• Then we can express wp as follows:

wp(S, Q)() = Q(step(S, ))

Example in Simplify syntax

From previous slide: wp(S, Q)() = Q(step(S, ))

*x = y + 1

{ y == 5 }

step(S, ) is:

wp(S, Q) is:

Example in Simplify syntax

From previous slide: wp(S, Q)() = Q(step(S, ))

*x = y + 1

{ y == 5 }

Q is: (EQ (select s y) 5)

step(S, ) is: (store s (select s x) (+ (select s y) 1))

wp(S, Q) is: (EQ (select (store s (select s x) (+ (select s y) 1)) y) 5)

ESC/Java summary

• Very general verification framework– Based on pre- and post-conditions

• Generate VC from code– Instead of modelling the semantics of the code inside

the theorem prover

• Loops and procedures require user annotations– But can try to infer these

ESC Java. Static Analysis Spectrum Power Cost Type checking Data-flow analysis Model checking...

Documents

Transcript of ESC Java. Static Analysis Spectrum Power Cost Type checking Data-flow analysis Model checking...

Checking the Boundaries of Static Analysis

BUSINESS CHECKING ACCOUNTS - Chase...for additional banking services may be billed to your Chase Analysis Business . Checking, Chase Platinum Business Checking or Chase Performance

Analysis of students schedules and checking unusual study plans ...

Modern PL/SQL Code Checking and Dependency Analysis

Semantic Analysis (Symbol Table and Type Checking)

Experiences using static analysis & model checking for bug finding

Architectural capability analysis using a model-checking ... · Architectural capability analysis using a model-checking technique ... RESUMEN: Este trabajo describe un enfoque matemático

Fissile Type Analysis: Modular Checking of Almost ... · Fissile Type Analysis: Modular Checking of Almost Everywhere Invariants University of Colorado Boulder Devin Coughlin Bor-Yuh

24: Model Checking & Reachability Analysissymbolaris.com/course/fcps16/reachability.pdf · 24: Model Checking & Reachability Analysis ... editors, Handbook of Model Checking. Springer,

ESPAÇOS PARA ALUGUEL: 4 ESPAÇOS HALL SOC. ESC. LAJE …€¦ · corte bb - edifÍcio 02 escala ..... 1/100 esc. esc. esc. esc. esc. esc. esc. esc. esc. esc. esc. esc. esc. esc.

Semantic Analysis: Type Checking - Computer Science · Semantic Analysis: Type Checking April 22, 2013 Monday, April 22, 13. Where We Are Lexical Analysis Semantic Analysis Syntax

Declarative Semantics Definition - Static Analysis and Error Checking

Header Space Analysis: Static Checking For Networkscseweb.ucsd.edu/~varghese/PAPERS/headerspace.pdf · Header Space Analysis: Static Checking For Networks Peyman Kazemian Stanford

Semantic Analysis II Type Checking

Failure Analysis of an E-commerce Protocol using Model Checking

Beyond Fact-Checking: Network Analysis Tools for Monitoring Disinformation in Social Media · 2020. 4. 25. · Beyond Fact-Checking: Network Analysis Tools for Monitoring Disinformation

Front-end syntax analysis + type-checking + symbol table

ENG.20080828.0016 esc Design Calculation or Analysis Cover ...

Syntax Analysis - · PDF fileSyntax Analysis 1. Lexical Analyzer Parser ... – For static semantics checking, e.g. type checking of ... Chomsky Hierarchy: Language

Performance analysis of fact-checking organizations and ...