Partitional Algorithms to Detect Complex Clusters

• Kernel K-means

• K-means applied in Kernel space

• Spectral clustering

• Eigen subspace of the affinity matrix (Kernel matrix)

• Non-negative Matrix factorization (NMF)

• Decompose pattern matrix (n x d) into two matrices: membership matrix (n x K) and weight matrix (K x d)

Kernel K-MeansRadha Chitta

April 16, 2013

When does K-means work?

• K-means works perfectly when clusters are “linearly separable” • Clusters are compact and well separated

When does K-means not work?

When clusters are “not-linearly separable”

Data contains arbitrarily shaped clusters of different densities

The Kernel Trick Revisited

The Kernel Trick Revisited Map points to feature space using basis function

Replace dot product .with kernel entry

Mercer’s condition:To expand Kernel function K(x,y) into a dot product, i.e. K(x,y)=(x)(y), K(x, y) has to be positive semi-definite function, i.e., for any function f(x) whose is finite, the following inequality holds

( ) ( , ) ( ) 0dxdyf x K x y f y

Kernel k-meansMinimize sum of squared error:

j jcixiju1 1

Replace with

jij jiu cx

2~)(min

k-means:

}1,0{iju 11

Kernel k-means:

Kernel k-means Cluster centers:

Substitute for centers:

jj xun

1)(1)(

Kernel k-means• Use kernel trick:

• Optimization problem:

• K is the n x n kernel matrix, U is the optimal normalized cluster membership matrix

UUKtraceKtracejiun

jij cx

UUKtraceUUKtraceKtrace maxmin

Example2k

k-meansData with circular clusters

Example

)22,212,2

1()2,1(

2)'(),( kernel Polynomial

xxxxxx

Kernel k-means

k-means Vs. Kernel k-meansk-means Kernel k-means

Performance of Kernel K-means

Evaluation of the performance of clustering algorithms in kernel-induced feature space, Pattern Recognition, 2005

Limitations of Kernel K-means• More complex than k-means• Need to compute and store n x n kernel matrix

• What is the largest n that can be handled?• Intel Xeon E7-8837 Processor (Q2’11), Oct-core, 2.8GHz, 4TB max memory• < 1 million points with “single” precision numbers• May take several days to compute the kernel matrix alone

• Use distributed and approximate versions of kernel k-means to handle large datasets

Spectral ClusteringSerhat BucakApril 16, 2013

Motivation

http://charlesmartin14.wordpress.com/2012/10/09/spectral-clustering/

Graph Notation

Hein & Luxburg

Clustering using graph cuts• Clustering: within-similarity high, between similarity low

minimize• Balanced Cuts:

• Mincut can be efficiently solved• RatioCut and Ncut are NP-hard• Spectral Clustering: relaxation of RatioCut and Ncut

Frameworkdata

Create an Affinity Matrix A

Construct the Graph Laplacian, L, of A

Solve the eigenvalue problem:

Lv=λv

Pick k eigenvectors that correspond to smallest k eigenvalues

Construct a projection matrix P using these k eigenvectors

Project the data:

Perform clustering (e.g., k-means) in the new space

Affinity (Similarity matrix)Some examples

1. The ε-neighborhood graph: Connect all points whose pairwise distances are smaller than ε

2. K-nearest neighbor graph: connect vertex vm to vn if vm is one of the k-nearest neighbors of vn.

3. The fully connected graph: Connect all points with each other with positive (and symmetric) similarity score, e.g., Gaussian similarity function:

http://charlesmartin14.files.wordpress.com/2012/10/mat1.png

Affinity Graph

Laplacian Matrix• Matrix representation of a graph• D is a normalization factor for affinity matrix A• Different Laplacians are available• The most important application of the Laplacian is spectral

clustering that corresponds to a computationally tractable solution to the graph partitioning problem

Laplacian Matrix

• For good clustering, we expect to have block diagonal Laplacian matrix

http://charlesmartin14.wordpress.com/2012/10/09/spectral-clustering/

Some examples (vs K-means)Spectral Clustering K-means Clustering

Ng et al., NIPS 2001

Some examples (vs connected components)Spectral Clustering Connected components (Single-link)

Ng et al., NIPS 2001

Clustering Quality and Affinity matrix

http://charlesmartin14.files.wordpress.com/2012/10/mat1.png

Plot of the eigenvector with the second smallest value

Application: social Networks• Corporate email communication (Adamic and Adar, 2005)

Hein & Luxburg

Application: Image Segmentation

Hein & Luxburg

Frameworkdata

Create an Affinity Matrix A

Construct the Graph Laplacian, L, of A

Solve the eigenvalue problem:

Lv=λv

Pick k eigenvectors that correspond to top eigenvectors

Construct a projection matrix P using these k eigenvectors

Project the data:

Perform clustering (e.g., k-means) in the new space

Laplacian Matrix• Given a graph G with n vertices, its n x n Laplacian matrix L is defined as:

L = D - A• L is the difference of the degree matrix D and the adjacency matrix A of

the graph• Spectral graph theory studies the properties of graphs via the

eigenvalues and eigenvectors of their associated graph matrices: adjacency matrix and the graph Laplacian and its variants

• The most important application of the Laplacian is spectral clustering that corresponds to a computationally tractable solution to the graph partitioning problem

Partitional Algorithms to Detect Complex Clusters

Documents

Transcript of Partitional Algorithms to Detect Complex Clusters

BPEC: Belief-Peaks Evidential Clustering version.pdf · them, partitional clustering has attracted a lot of attention in artificial intelligence communities. Classical hard partitional

Last lecture summary. Cluster analysis unsupervised hierarchical clustering – agglomerative – divisive – dendrogram partitional – k-means MITO BANARMFL.

Metallic Clusters, Metallic Clusters, MesoscopicMesoscopic ... Presentations3/Zachariah AFOSR Workshop 21 January 2015.pdfMetallic Clusters, Metallic Clusters, MesoscopicMesoscopic

To Design and Implement Data Mining Application By … Design and Implement Data Mining Application By ... Abstract ² This paper presents how clusters are made by using Partitional

Partitional Values1

Genetic Screening Strategy for Rapid Access to Polyether ... · been found in all ﬁve of the complete gene clusters of polyether ionophores published so far. To detect potential

A SaTScan STUDY TO DETECT SPATIO- TEMPORAL CLUSTERS …adalyajournal.com/gallery/81-dec-2487.pdf · surveillance for the detection of disease clusters. The detection of cluster investigation

Uncertain Centroid based Partitional Clustering of ...vldb.org/pvldb/vol5/p610_francescogullo_vldb2012.pdf · Uncertain Centroid based Partitional Clustering of Uncertain Data Francesco

CHIRAL TRANSITION METAL CLUSTERS · 2016-05-02 · isopropyl Cp and the indenyl ligands served as NMR probes to detect ... This fluxionality was monitored by variable-temperature

Star Clusters: Confirmation of Stellar Evolution Open and Globular Clusters Ages of Clusters

Clustering Algorithms for Genetic Analysis with GeneMarkerThere are two types of data clustering: hierarchical and partitional. Partitional clustering includes the K-means and Self-Organizing

Kiwok Bodykom Oslo 2007 ver 3 - SINTEF · Select heart beat templates > Detect Ignore Detect Ignore Detect Ignore Detect Ignore Detect Ignore Detect Ignore

PARTITIONAL CLUSTERING

Spatio-Temporal Data Mining to Detect Changes and Clusters in … · 2013-01-18 · Spatio-Temporal Data Mining to Detect Changes and Clusters in Trajectories by ... a change in these

A new method to detect globular clusters with the S-PLUS ...

Alchemist XF Understanding Cadence - SAM · PDF fileAlchemist XF Understanding Cadence Cadence, Detect Film Detect . Detect Detect Detect

Clusters and clusters strategies

Industry clusters 1 Background Controversy Cluster drivers Identifying clusters Clusters and economic performance Clusters and policy.

데이터시트: 태니엄 IOC Detect / Datasheet: Tanium IOC Detect

Decision - Georgia Institute of Technologyhpark/papers/cluster_JOGO.pdffashion. Moreo v er, the desired n um b er of clusters for partitional clustering metho ds, e.g. k-means, has