Post on 17-Feb-2017
Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016
Timo Honkela
Modeling Meaning and Knowledge25 Apr 2016
timo.honkela@helsinki.fi
An introduction totext mining
Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016
Data mining
Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016
Data mining tasks(Hand, Mannila & Smyth 2001)
● Exploratory data analysis● Descriptive modeling● Prescriptive modeling:
classification and regression● Discovering patterns and rules● Retrieval by content
Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016
Text mining
http://www.intechopen.com/books/theory-and-applications-for-advanced-text-mining
Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016
Text mining
● Finding structures and relations at different levels of abstraction
● Study of distributions, trends and correlations● Text classification and clustering● Entity extraction● Authorship analysis● Sentiment analysis● etc. etc.
Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016
Application areas of text mining
● Digital humanities– Sociology
– History
– Literature
– Law
● Knowledge management● Customer relationship management (CRM)● Competence management
– Archeology
– Linguistics
– Religion
– Philosophy
● Remember also– Medicine
– Psychology
– Geology
– etc.
Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016
Examples using the SOM
● Art museum visitorsPockets full of memories: an interactive museum installationG Legrady, T HonkelaVisual Communication 1 (2), 163-169
● PoetryIn search for volta: Statistical analysis of word patterns in Shakespeare's sonnetsO Kohonen, S Katajamäki, T Honkela.Proceedings of AMKLC'05, International Symposium on Adaptive Models of Knowledge, Language and Cognition, pages 44–47, Finland
● Religious cognitionCounterintuitiveness as the hallmark of religiosityI Pyysiäinen, M Lindeman, T HonkelaReligion 33 (4), 341-355
● CompetenceDocument maps for competence managementT Honkela, R Nordfors, R TuuliProceedings of the Symposium on Professional Practice in AI, 31-39
Dimensionality reductionVisualizationAbstraction
Timo Honkela, Modeling Meaning and Knowledge, 25.4.2016
New projects
● Digital Mindscapes: Mining social media(Jussi Pakkasvirta, Krista Lagus, Mika Pantzar, Minna Ruckenstein, etc.)
http://www.aka.fi/globalassets/32akatemiaohjelmat/digihum/citizen-mindscapes-digihum-starts_3-vain-luku.pdf
● Computational History 1640–1910: Mining newspapers(Mikko Tolonen, Kimmo Kettunen, Hannu Salmi, Tapio Salakoski, etc.)
http://www.aka.fi/globalassets/32akatemiaohjelmat/digihum/comhis-presentation-logomo-22-march-2016.pdf
In many casesa supportinginfrastructureis FIN-CLARIN