Spectral Graph Theory and Graph CNN - qdata.github.io€¦ · Reference 1 Laplacian Operator -...
Transcript of Spectral Graph Theory and Graph CNN - qdata.github.io€¦ · Reference 1 Laplacian Operator -...
Spectral Graph Theory and Graph CNN
https://qdata.github.io/deep2Read
Presenter : Ji Gao
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 1 / 28
Outline
1 Graph LaplacianDefinitionsWhy Laplacian?Graph Fourier Transform
2 Spectral Neural Network
3 Fast Spectral Filtering
4 Reference
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 2 / 28
Outline
1 Graph LaplacianDefinitionsWhy Laplacian?Graph Fourier Transform
2 Spectral Neural Network
3 Fast Spectral Filtering
4 Reference
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 3 / 28
Graph
Graph
A graph G = (V ,E ), where V = 1, 2..N is the set of Vertices andE ⊆ V × V .
(Vertex) Weighted Graph
A weighted graph G = (V ,E ,W ), where V = 1, 2..N is the set ofVertices, E ⊆ V × V , W : V → R.
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 4 / 28
Graph
Degree
The degree d(v) of a vertex v is the number of vertices in G that areadjacent to v .
Adjacency Matrix
Adjacency matrix A of the graph G is a n × n matrix that
Aij =
1 (i , j) ∈ E
0 Otherwise
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 5 / 28
Graph Laplacian
(Unnormalized) Graph Laplacian
Graph Laplacian L = diag(d)− A, which
Lij =
di i = j
−1 i 6= j&(i , j) ∈ E
0 Otherwise
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 6 / 28
Outline
1 Graph LaplacianDefinitionsWhy Laplacian?Graph Fourier Transform
2 Spectral Neural Network
3 Fast Spectral Filtering
4 Reference
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 7 / 28
Why Laplacian?
Laplacian
For function f, Laplacian operator ∆f = ∇ · ∇f
Laplacian represents the divergence of the gradient.
It’s a coordinate-free operator!
In physics, if a electromagnetic field is defined by a electrostaticpotential function φ, then ∆φ gives the charge distribution in thefield.
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 8 / 28
Eigenfunction of Laplacian Operator
Eigenfunction of Laplacian in (0, 1)
Suppose f is the eigenfunction of the Laplacian:
∆f + λf = 0, f (0) = f (1) = 0
∆f =∂2f
∂x2= −λf
The only non-trivial solution of the Laplacian is
fn(x) = C sin(nπx), n ∈ N
fn is the Fourier sine series.
fn together forms an orthonormal basis of the space L2(0, 1)
Theorem: For any L2(Ω) space where Ω is a reasonably smoothdomain, there exists an orthonormal family of eigenfunctions of ∆that forms an orthonormal basis of the space.
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 9 / 28
Outline
1 Graph LaplacianDefinitionsWhy Laplacian?Graph Fourier Transform
2 Spectral Neural Network
3 Fast Spectral Filtering
4 Reference
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 10 / 28
Graph Laplacian Revisited
Graph Laplacian
Graph Laplacian L = D − A
Suppose f is a function from vertex to R.
f can be represented by a vector (f1, f2...fn) with size n.
Therefore, [Lf ]i = di −∑
j Aij fj =∑
j Aij(fi − fj)
Calculating the difference on the value of a vertex to its neighbors!
f TLf =∑
<i ,j>∈E(fi − fj)
2
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 11 / 28
Graph Laplacian Revisited
Graph Laplacian
L = D − A
f TLf =∑
<i ,j>∈E(fi − fj)
2
Symmetric real matrix −→ Real eigenvalues
Positive semidefinite −→ Non-negative eigenvalues
First eigenvalue is 0 with eigenvector 1, 1, 1...10 = λ1 ≤ λ2 ≤ ... ≤ λn
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 12 / 28
Graph Fourier transform
The eigenvector of graph Laplacian matrix can be used as aorthonormal basis of the Hilbert space.
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 13 / 28
Graph Spectral Filtering
Filters can be used to form a convolutional layer
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 14 / 28
Spectral Networks and Deep Locally Connected Networkson graphsJoan Bruna, Wojciech Zaremba, Arthur Szlam, Yann Lecun
CNN is powerful. Extend CNN to general graphs.
1. Use hierarchical clustering
2. Use spectrum of graph laplacian to learn convolutional layers
Efficient: Number of parameters is independent of input size
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 15 / 28
Spatial CNN use Hierarchical clustering
Form a multi-scale clustering
The k-th layer has dk clusters
The k-th layer has fk filters
Convolutional Layer
For j = 1..fk ,
xk+1,j = Lkh(
fk−1∑i=1
Fk,i ,jxk,i )
.Fk , i , j is a dk−1 × dk−1 sparse matrix.Lk is a pooling operation.
Clusters are pre-defined by hierarchical clustering.
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 16 / 28
Spectral Construction
Spectral Convolution
Suppose V is the eigenvectors of L.Input: xk , size n × fk−1Without spatial subsampling:
xk+1,j = h(U
fk−1∑i=1
Fk,i ,jUT xk,i )
Fk,i ,j is a diagonal weight matrix.
Only use top d eigenvectors to reduce cost.
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 17 / 28
Experiment 1: Subsampled MNIST
Subsample MNIST to 400 points
Baseline: Nearest Neighbor (4.11% Error rate)
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 18 / 28
Experiment 1: Subsampled MNIST
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 19 / 28
Experiment 2: Sphere MNIST
Project MNIST to sphere
Uniformly or Randomly
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 20 / 28
Experiment 2: Sphere MNIST
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 21 / 28
Convolutional Neural Networks on Graphs with FastLocalized Spectral FilteringMichael Defferrard, Xavier Bresson, Pierre Vandergheynst
Improve previous spectral CNN
Main Contributions:
Strictly localized filtersLow computational complexityEfficient pooling methodMultiple experiment on different datatypes
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 22 / 28
Filters
graph filter
y = UTg(Λ)Ux
Where U is the eigenvector of L and Λ is the diagonal matrix of alleigenvalues of L
Naive approach is to learn g(Λ) = diag(θ) directly.
Limitations:
It’s not localizedThe complexity is O(n).
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 23 / 28
Polynomial filter
Polynomial filter
g(Λ) =∑L
k=1 θkΛK
Spectral filters represented by K th-order polynomials of the Laplacianare K -localized: connect all the vertices in at most K steps.
Learning complicity is O(K )
Use Chebyshev polynomial to make it faster: g(Λ) =∑L
k=1 θkTk(Λ),where Tk = 2xTk−1 − Tk−2
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 24 / 28
Pooling
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 25 / 28
Experiment 1: MNIST
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 26 / 28
Experiment 2: 20Newsgroup
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 27 / 28
Reference
1 Laplacian Operator - Wikipedia
2 An introduction to spectral graph theory Jiang Jiaqi
3 Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering MichaelDefferrard, Xavier Bresson, Pierre Vandergheynst(EPFL, Lausanne, Switzerland)
4 Spectral Networks and Locally Connected Networks on Graphs Joan Bruna, WojciechZaremba, Arthur Szlam, Yann LeCun
5 Graph signal processing: Concepts, tools and applications Xiaowen Dong
(University of Virginia) Spectral Graph Theory and Graph CNN Presenter : Ji Gao 28 / 28