Curatir

9
CURATIR: AN ART EXPLORER FOR NYC WEI (THOMAS) WANG COLUMBIA UNIVERSITY

Transcript of Curatir

CURATIR: AN ARTEXPLORER FOR NYC

WEI (THOMAS) WANGCOLUMBIA UNIVERSITY

FINDING ARTS IN NEW YORK

How many Picassos are in New York?

DEMO

Go to http://curatir.weiwang.io

THE STACK

Data Pipeline

THEMATIC ANALYSIS OF PAINTERSTokenize->Stem->Tag->TFiDF

Latent Semantic Indexing(SVD) is used to linearlyproject the term frequencydistribution to a lowerdimensional (50) space.

Cosine similarities betweenartists are calculated.

while trying to stay active

ABOUT MEStat PhD at Columbia and work on Bayesian modeling and causal inference

ARTISTS' THEMES

Themes are extracted from TFiDF-adjusted frequencydistribution of adjectives (found via Part-of-Speechtagging).To associate artists with a particular theme, the theme istreated as a one word document and cosine similaritybetwen the document and artist is calculated.

SAMPLE OUTPUTS