Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection...

25
1 Studying Multicultural Diversity of Cities through Social Media Language Detection Michela Arnaboldi, Marco Brambilla , Beatrice Cassottana, Paolo Ciuccarelli, Davide Ripamonti, Simone Vantini, Riccardo Volonterio.

Transcript of Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection...

Page 1: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

1

Studying Multicultural Diversity of Cities through Social Media Language Detection

Michela Arnaboldi, Marco Brambilla, Beatrice Cassottana, Paolo Ciuccarelli, Davide Ripamonti, Simone Vantini, Riccardo Volonterio.

Page 2: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

The digital reflection of the

city

Page 3: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

The Urban Macroscope

Joël de Rosnay, The Macroscope, 1979

Page 4: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

4

1. City and the world

The first lens:

Which parts of the world do talk to Milan and which parts does Milan talk to?

Mobile-phone calls from Milan and toward Milan.

Page 5: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

1. City and the world: the countries that talk to Milan (1)

5

Page 6: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

1. City and the world: the countries that talk to Milan (2)

6

EU + CH

Page 7: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

1. City and the world: the countries that talk with Milan (7)

7

Page 8: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

2. Cities into cities: a city made of different cities

8

The three digital cities in Milan, based on Twitter languages.

The three cities of Twitter are: • A city that speaks in Italian with itself and with the rest of

Italy;• An international city that speaks in English with the rest of

the world;• A multi-ethnic city projected toward the new urban

communities and the communities of origin.

Page 9: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

9

2. Cities into citiesIn red the most «Italian» NILs, in blue the most «global», in yellow the most «multiethnic»

Page 10: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

10

2. Cities into cities In red the most «Italian» NILs, in blue the most «global», in yellow the most «multiethnic»

Among the NIL with a statistically significant number of tweets in the considered quarter, the coloured areas (25% of total areas) represent the NIL with the highest shares of messages.

Page 11: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

2. Cities into cities

Which languages other than Italian and English characterize Milan?

11

Page 12: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

2. Cities into cities

Spanish 25%

12

Heterogeneity in the distribution languages different from Italian and English (NIL characterized by a high linguistic entropy).

Which languages other than Italian and English characterize Milan?

Page 13: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

2. Cities into cities

Spanish 57%

Indonesian 36%

13

NIL characterized by an average linguistic entropy.

Which languages other than Italian and English characterize Milan?

Page 14: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

2. Cities into cities

Arabic 89%

14

There is a predominant language among the ones different from Italian and English (NIL characterized by a low linguistic entropy).

Which languages other than Italian and English characterize Milan?

Page 15: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

Some numbers from Twitter

NIL % of predominant language

Trenno 72% ArabicLoreto, Umbria – Molise, Ortomercato

64% Arabic

Parco Forlanini - Ortica 63% ArabicQuintosole 100% SpanishParco dei Navigli 69% SpanishGallaratese 66% SpanishCascina Triulza- Expo 57% SpanishEx Om – Morivione 56% SpanishMecenate 51% SpanishPadova 59% TagalogGiambellino 57% TagalogVillapizzone 56% TagalogBruzzano 59% PortugueseParco Nord 55% DutchChiaravalle 75% Norwegian

Page 16: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

Some official numbers from Census Data

Page 17: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

Correlation: Twitter vs. official residents

• Tagalog, Ukranian and Romanian overexposed

• Portuguese, Dutch, Norwegian, Albanian underexposed

• Arabic, Spanish largely present and slightly overexposed

Page 18: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

18

3. City Magnets

The third lens:

Which are the main attractions of the city?

• the most popular places and the most “checked-in” locations by the users of Foursquare (Swarm).

Page 19: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

19

3. City Magnets and EXPO

• The Map during Expo. Expo as a new attraction, which does not delete the “traditional” ones

Page 20: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

20

4. Top Venues

The fourth lens:

Which are the most «cool» places of Milan?

• The analysis of the checkins according to the type of «venue» allows to see where the people want to be «seen».

Page 21: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

21

4. Top Venues: The appearance Expo (1)

• The columns represent the places with the highest numbers of checkins in April, May and June.

Page 22: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

22

4. Top Venues: The Hotels (2)

• The places where the people in the city want to be seen.(the dynamic of the hotels between Expo and the Furniture Fair)

Page 23: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

23

4. Top Venues: The Hotel (5)

• The places where the people in the city want to be seen(the dynamic of the hotels between Expo and the Furniture Fair)

Page 24: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

Further work (lenses)

• Photo analysis– Flickr vs. Instagram

• Frequent paths – Mobile phone data vs. sensors

• Events coverage

Page 25: Studying Multicultural Diversity of Cities and Neighborhoods through Social Media Language Detection  - CityLab workshop on smartcity at ICWSM

25

Questions?

Contact: [email protected]@marcobrambi