A PCP Characterization of AM

40
A PCP Characterization of AM Andrew Drucker MIT July 7, 2011 Andrew Drucker MIT, A PCP Characterization of AM 1/40

Transcript of A PCP Characterization of AM

Page 1: A PCP Characterization of AM

A PCP Characterization of AM

Andrew DruckerMIT

July 7, 2011

Andrew Drucker MIT, A PCP Characterization of AM 1/40

Page 2: A PCP Characterization of AM

The class AM

I Arthur-Merlin (AM) protocols: a generalization of NPprotocols [Babai-Moran ’88]

I Explores the power of randomness in interaction with a prover.

Andrew Drucker MIT, A PCP Characterization of AM 2/40

Page 3: A PCP Characterization of AM

The class AM

I L ∈ AM if there exists a polynomial-time algorithmM(x , r ,w), with

|r |, |w | ≤ poly(|x |) ,such that:

1. x ∈ L ⇒ ∀r ∃w : M(x , r ,w) = 1;

2. x /∈ L ⇒ Prr [ ∃w : M(x , r ,w) = 1] ≤ 1/3.

I r = “random challenge”; w = “witness”.

Andrew Drucker MIT, A PCP Characterization of AM 3/40

Page 4: A PCP Characterization of AM

The class AM

I (ΠY ,ΠN) ∈ AM if there exists a polynomial-time algorithmM(x , r ,w), with

|r |, |w | ≤ poly(|x |) ,such that:

1. x ∈ ΠY ⇒ ∀r ∃w : M(x , r ,w) = 1;

2. x ∈ ΠN ⇒ Prr [ ∃w : M(x , r ,w) = 1] ≤ 1/3.

I r = “random challenge”; w = “witness”.

Andrew Drucker MIT, A PCP Characterization of AM 4/40

Page 5: A PCP Characterization of AM

Andrew Drucker MIT, A PCP Characterization of AM 5/40

Page 6: A PCP Characterization of AM

Andrew Drucker MIT, A PCP Characterization of AM 6/40

Page 7: A PCP Characterization of AM

Andrew Drucker MIT, A PCP Characterization of AM 7/40

Page 8: A PCP Characterization of AM

Andrew Drucker MIT, A PCP Characterization of AM 8/40

Page 9: A PCP Characterization of AM

AM vs. NP

I Clearly AM ⊇ NP.

I Is AM = NP? Is AM ⊆ NSUBEXP?

Andrew Drucker MIT, A PCP Characterization of AM 9/40

Page 10: A PCP Characterization of AM

“Hardness vs. randomness”

I “Hardness vs. randomness” paradigm: sufficiently strongcircuit lower bounds for exponential-time classes implynontrivial derandomization of AM, even up to AM = NP.

[Miltersen, Vinodchandran ’99; Shaltiel, Umans ’09]

I Gives a plausible reason to believe that AM = NP;

I But, not a currently viable approach to actually prove it!

Andrew Drucker MIT, A PCP Characterization of AM 10/40

Page 11: A PCP Characterization of AM

“Hardness vs. randomness”

I Alternative approaches to AM vs NP?

I (Caution: any proof of AM = NP will imply some new circuitlower bounds;but weaker than those needed for hardness vs. randomnessapproach.)

Andrew Drucker MIT, A PCP Characterization of AM 11/40

Page 12: A PCP Characterization of AM

Another approach

1. Identify “easiest” AM-hard problems;

2. Attack them with new algorithmic ideas.

I This work:

gives candidate for (1), based on PCPs;

shows obstacles to one algorithmic approach.

Andrew Drucker MIT, A PCP Characterization of AM 12/40

Page 13: A PCP Characterization of AM

PCPs for AM

I We give a “PCP characterization of AM”:

I For every L ∈ AM, there’s an AM protocol for L in whichArthur looks at only O(1) bits of the witness string, and O(1)bits of the random challenge!

Andrew Drucker MIT, A PCP Characterization of AM 13/40

Page 14: A PCP Characterization of AM

Andrew Drucker MIT, A PCP Characterization of AM 14/40

Page 15: A PCP Characterization of AM

Andrew Drucker MIT, A PCP Characterization of AM 15/40

Page 16: A PCP Characterization of AM

Andrew Drucker MIT, A PCP Characterization of AM 16/40

Page 17: A PCP Characterization of AM

Andrew Drucker MIT, A PCP Characterization of AM 17/40

Page 18: A PCP Characterization of AM

Andrew Drucker MIT, A PCP Characterization of AM 18/40

Page 19: A PCP Characterization of AM

Andrew Drucker MIT, A PCP Characterization of AM 19/40

Page 20: A PCP Characterization of AM

Andrew Drucker MIT, A PCP Characterization of AM 20/40

Page 21: A PCP Characterization of AM

Related work

I Idea of giving PCP-based complete problems for complexityclasses other than NP is not new.

I Similar analogues of PCP Theorem given for:

1. PH (the Polynomial Hierarchy)[Ko, Lin ’94], [Haviv, Regev, Ta-Shma ’07]

2. PSPACE[Condon, Feigenbaum, Lund, Shor ’95], [Drucker ’11]

3. IP = PSPACE[CFLS ’97]

Andrew Drucker MIT, A PCP Characterization of AM 21/40

Page 22: A PCP Characterization of AM

Switching views

I PCP Theorem can be described in terms of proof systems, orin terms of Constraint Satisfaction Problems (CSPs).

I Similarly with our result. We’ll work with CSP viewpoint.

Andrew Drucker MIT, A PCP Characterization of AM 22/40

Page 23: A PCP Characterization of AM

Stochastic CSPs

I k-CSPs: a family ψ1(z), . . . , ψm(z) of constraints on variablesz : each ψi is k-local.

Let Valψ(z) = fraction of constraints ψi satisfied by z .

I Stochastic CSPs: ψ(r , z)

r = “random challenge” variables;

z = “witness/response” variables.

Andrew Drucker MIT, A PCP Characterization of AM 23/40

Page 24: A PCP Characterization of AM

A complete problem for AMI Say that ψ(r , z) is risk-free if

∀r ∃z : Valψ(r , z) = 1 .

I Say that ψ(r , z) is ε-risky if with probability ≥ 2/3 overuniform r ,

∀z : Valψ(r , z) < 1− ε .We show:

Theorem 1There is an ε > 0 and a constant-size alphabet Σ such that,for stochastic 2-CSPs over Σ, it is AM-complete to distinguishbetween the cases

1. ψ(r , z) is risk-free;

2. ψ(r , z) is ε-risky.

Call this promise problem Gap− Stoch− 2CSPΣ,ε.Andrew Drucker MIT, A PCP Characterization of AM 24/40

Page 25: A PCP Characterization of AM

Sketch of the proof

I Easy to see that the problem is in AM. Nontrivial direction:show it’s AM-hard.

I Will show how to reduce any L ∈ AM toGap− Stoch− 2CSPΣ,ε.

(Promise problems Π ∈ AM handled same way.)

I Given: an AM protocol M(x , r ,w) for a language L ∈ AM.

Andrew Drucker MIT, A PCP Characterization of AM 25/40

Page 26: A PCP Characterization of AM

Sketch of the proof

I Step 1: improve the soundness guarantee of M.

Initial soundness = 1/3.

I Can drive down soundness to (1/3)k by k-fold parallelrepetition of M; but, blows up |r | unacceptably.

I Instead, use randomness-efficient soundness amplification of[Bellare, Goldreich, Goldwasser ’93]. Gives a new protocolM ′ for L, such that

x /∈ L =⇒ Prr

[∃w : M ′(x , r ,w) = 1] ≤ 2−Ω(|r |) .

Andrew Drucker MIT, A PCP Characterization of AM 26/40

Page 27: A PCP Characterization of AM

Sketch of the proof

I Assume for simplicity that in M ′, we have |r | ≥ |w |. (Canremove this assumption.)

I Let Cx(r ,w) := M ′(x , r ,w).

Cx implementable by a poly(n)-sized circuit.

I Then, rephrasing:

x /∈ L =⇒ w.h.p. over r ,

(r ,w) is Ω(1)−far in relative distance from C−1x (1), for all w .

Andrew Drucker MIT, A PCP Characterization of AM 27/40

Page 28: A PCP Characterization of AM

Sketch of the proof

I Step 2: Transform Cx(r ,w) into probabilistically checkableformat.Key tool: Prob. checkable proofs of proximity (PCPPs)[Dinur, Reingold ’04; Ben-Sasson et al. ’04]

Theorem (Dinur ’06)

There is a polytime transformation mapping a circuit C (Y ) to a2-CSP ψ(Y , z) over a constant-sized alphabet, such that for all y :

1. C (y) = 1 =⇒ ∃z : Valψ(y , z) = 1;

2. If y is δ-far from C−1(1), then ∀z : Valψ(y , z) < 1− Ω(δ).

Andrew Drucker MIT, A PCP Characterization of AM 28/40

Page 29: A PCP Characterization of AM

Sketch of the proof

I Let ψCx = ψCx (r ,w , z) be the output of Dinur’s reduction,applied to Cx .Let r be the random challenge vars; (w , z) witness-variables.

I Easy to check that x 7−→ ψCx is the reduction we are lookingfor:

1. x ∈ L =⇒ ψCx is risk-free;

2. x /∈ L =⇒ ψCx is Ω(1)-risky.

I This proves Gap− Stoch− CSPΣ,ε is AM-hard

(for small ε > 0).

Andrew Drucker MIT, A PCP Characterization of AM 29/40

Page 30: A PCP Characterization of AM

What next?

I Nontrivial derandomization for our AM-complete promiseproblem?

...haven’t found one.

I How might we try?

Andrew Drucker MIT, A PCP Characterization of AM 30/40

Page 31: A PCP Characterization of AM

What next?

I For a stochastic 2-CSP ψ(r , z), what is the complexity ofapproximately optimizing over z , for randomly selected r?

I Perhaps easy, if we allow algorithm to depend nonuniformlyon ψ...

Andrew Drucker MIT, A PCP Characterization of AM 31/40

Page 32: A PCP Characterization of AM

A “randomized optimization” hypothesis

Hypothesis A

For any fixed δ, ε > 0, and any stochastic 2-CSP ψ(r , z) of size n,there is an “optimizer” circuit OPTψ(r), of size polyδ,ε(n) over r ,such that with prob. 1− δ,

Valψ (r ,OPTψ(r)) ≥ maxz

(Valψ (r , z))− ε .

Andrew Drucker MIT, A PCP Characterization of AM 32/40

Page 33: A PCP Characterization of AM

A “randomized optimization” hypothesis

ClaimHypothesis A implies AM = MA.

I Proof of Claim uses our AM-completeness result.

I If the optimizer circuits in Hyp. A can be NC0 circuits, we’dget the stronger conclusion AM = NP.

Andrew Drucker MIT, A PCP Characterization of AM 33/40

Page 34: A PCP Characterization of AM

Evidence against the hypothesis

I But—if NP is sufficiently hard, our plan fails:

Theorem 2Suppose some L ∈ NP is 2/3-hard on average for circuits of size2Ω(n).

Then, Hyp A fails.

Andrew Drucker MIT, A PCP Characterization of AM 34/40

Page 35: A PCP Characterization of AM

Proof sketch for Theorem 2

I Step 1: Our hardness assumption for L =⇒∃ a poly-time predicate M(r ,w) such that:

1. M(r , ·) is satisfiable w.h.p.; but,

2. For any poly-sized circuit C (r),

Prr

[M(r ,C (r)) = 1] = 2−Ω(|r |) (tiny) .

I Assume for simplicity: L balanced: |Ln| = 2n−1.

Andrew Drucker MIT, A PCP Characterization of AM 35/40

Page 36: A PCP Characterization of AM

Proof sketch for Theorem 2

I Natural idea for M(r ,w):

r consists of many independent random strings of length n;M(r ,w) accepts iff w supplies proofs that at least a .49

fraction of them lie in L.

I M(r , ·) is satisfiable w.h.p.—by Chernoff bounds!

I Any poly-sized circuit fails to satisfy M(r , ·):

Follows from hardness assumption on L and Direct Producttheorems.

I Problem: uses too much randomness.

Andrew Drucker MIT, A PCP Characterization of AM 36/40

Page 37: A PCP Characterization of AM

Proof sketch for Theorem 2

I Solution: use Impagliazzo-Wigderson PRG [IW ’97].

I To prove concentration property needed, apply recent StrongChernoff Bound for Expander Walks[Wigderson, Xiao ’05], [WX ’08], [Healy ’08].

Andrew Drucker MIT, A PCP Characterization of AM 37/40

Page 38: A PCP Characterization of AM

Proof sketch for Theorem 2

I Step 2: convert our predicate M(r ,w) into prob. checkableform.

I Idea: use PCPPs + error-correcting codes.

I This proves Theorem 2.

Andrew Drucker MIT, A PCP Characterization of AM 38/40

Page 39: A PCP Characterization of AM

Summary

I Gave a new AM-complete problem, perhaps the “easiest”known.

I Advocated searching for an algorithmic attack on this problemto derandomize AM.

I Found obstacles to one natural approach.

Andrew Drucker MIT, A PCP Characterization of AM 39/40

Page 40: A PCP Characterization of AM

Open Problems

I Complexity of Gap− Stoch− 2CSPΣ,ε when each “random”variable in ψ(r , z) appears only O(1) times in ψ1, . . . , ψm?

I Better hardness-vs-randomness results usingGap− Stoch− 2CSPΣ,ε?

I New upper bounds on the power of AM protocols?

Andrew Drucker MIT, A PCP Characterization of AM 40/40