Developing Smart Cities Services through Semantic Analysis of Social Streams

download

of 105

  • date post

    22-Aug-2015
  • Category

    Technology

  • view

    742
  • download

    0

Embed Size (px)

Transcript of Developing Smart Cities Services through Semantic Analysis of Social Streams

  1. 1. Developing Smart Cities Services through Semantic Analysis of Social Streams Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops (Universit degli Studi di Bari Aldo Moro, Italy - SWAP Research Group) WDS4SC 2015 WWW 2015 Workshop on Web Data Science and Smart Cities Florence (Italy) - May 19, 2015
  2. 2. Outline Background Information Overload Social Content Analytics CrowdPulse Social Data Extraction Semantic Tagging Sentiment Analysis Processing & Visualization Use Cases LAquila Social Urban Network The Italian Hate Map Conclusions 2Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  3. 3. Background 3Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  4. 4. Background 4 Information Overload Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  5. 5. Information Overload 5 in digital life Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  6. 6. Information Overload 6 in real life Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  7. 7. 7 Obstacleor Opportunity? Information Overload Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  8. 8. Background (again) 8Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  9. 9. 9 Social Networks can be considered as novel data silos Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  10. 10. 10 Social Networks information about preferences Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  11. 11. 11 Social Networks information about connections Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  12. 12. 12 Social Networks information about people feelings Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  13. 13. 13 Social Networks changed the rule for content analytics Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  14. 14. 14 Social Content Analytics Successful Use Cases - Online brand monitoring - Social CRM - Real-time polls All these applications share a common insight Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  15. 15. 15 Social Content Analytics Research Question Is it possible to aggregate rough human-generated data to get complex people-based ndings? Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  16. 16. 16 Our contribution: CrowdPulse A framework for real-time Semantic Analysis of Social Streams Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  17. 17. 17 CrowdPulse Social Data Extraction Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015 features Semantic Tagging Sentiment Analysis Processing & Visualization
  18. 18. 18Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015 workow CrowdPulse
  19. 19. 19Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015 Step 1: Social Data Extraction CrowdPulse
  20. 20. 20Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015 Step 1: Social Data Extraction Extraction Source Heuristics CrowdPulse
  21. 21. 21Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015 Step 1: Social Data Extraction Extraction Source Heuristics CrowdPulse
  22. 22. 22Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015 Step 1: Social Data Extraction Extraction Source Heuristics Content User Geo Content+Geo #www2015 #democrats #trafc @barack_obama @comune #earthquake Page Group CrowdPulse
  23. 23. 23Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015 Step 1: Social Data Extraction Extraction Source Heuristics Content User Geo Content+Geo #www2015 #democrats #trafc @barack_obama @comune #earthquake Page Group We only extract public content CrowdPulse
  24. 24. 24Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015 Step 2: Semantic Tagging CrowdPulse
  25. 25. 25 aquila ? ? (eagle) (italian city) (italian) Semantic Tagging Motivations Poor Semantics Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  26. 26. Keyword-based representation introduces a lot of noise in the analysis 26 aquila ? ? (eagle) (italian city) (italian) Semantic Tagging Motivations Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  27. 27. (Please, do something: lAquila is going to die!) (Please, do something: the eagle is going to die!) Fate qualcosa per favore, lAquila sta morendo! ? 27 Semantic Tagging Motivations Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  28. 28. Entity Linking Algorithms Input: textual content Output: identication and disambiguation of the entities mentioned in the text. (1) http://tagme.di.unipi.it (2) http://spotlight.dbpedia.org 28 Step 2: Semantic Tagging Solution: semantic processing of extracted content Algorithms CrowdPulse Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  29. 29. 29 Step 2: Semantic Tagging CrowdPulse Entity Linking: identication and disambiguation of the entities mentioned in the text. Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  30. 30. 30 Step 2: Semantic Tagging CrowdPulse Non-trivial NLP tasks (stopwords removal, n-grams identication, named entities recognition and disambiguation) are automatically performed Entity Linking: identication and disambiguation of the entities mentioned in the text. Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  31. 31. CrowdPulse 31 Step 2: Semantic Tagging Entity Linking: identication and disambiguation of the entities mentioned in the text. Each entity is a reference to a Wikipedia page http://it.wikipedia.org/wiki/Massimo_Cialente IMPORTANT! Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic Analysis of Social Streams. WDS4SC 2015 Workshop, Florence (Italy) 19.05.2015
  32. 32. We enriched the entity-based representation by exploiting the Wikipedia categories tree 32Cataldo Musto, Giovanni Semeraro, Marco de Gemmis, Pasquale Lops Developing Smart Cities Services through Semantic