Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84...

51
Semantic Change in the Time of Machine Learning: doing it right! Haim Dubossarsky 1st International Workshop on Computational Approaches to Historical Language Change Florence, August 2019 [email protected]

Transcript of Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84...

Page 1: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Semantic Change in the

Time of Machine Learning:

doing it right! Haim Dubossarsky

1st International Workshop on Computational Approaches to Historical Language Change

Florence, August 2019 [email protected]

Page 2: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Congratulations!

Historical distributional semantics

Page 3: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Outline

• Problem breakdown

• Working with faulty models

• Case I: Laws of semantic change

• Case II: Comparing models’ quality

• Conclusions

Page 4: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

it’s everywhere,

it’s effects can be felt,

but you cannot see or touch it

-> meaning is the dark matter of language

Problem breakdown

Slide, courtesy of Prof. Dirk Geeraerts

Page 5: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

it’s everywhere,

it’s effects can be felt,

but you cannot see or touch it

Meaning

change -> meaning is the dark matter of language

Problem breakdown

Page 6: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Solving this conundrum Problem breakdown

Page 7: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

The distributional hypothesis

You shall know a word by the company it keeps (Firth, J. R. 1957:11)

Words occurring in similar contexts tend to have similar meanings (Z. Harris, 1954)

Problem breakdown

Page 8: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Problem breakdown

* Not a survey

Word embeddings*

• Could be sparse vectors (counts, PPMI, RI)

• Or dense vectors (word2vec , FastText, Glove)

• Or yet contextual embedding (ELMo, Bert)

All define meaning as usage statistics.

𝑤𝑖 = 𝑏𝑟𝑜𝑎𝑑𝑐𝑎𝑠𝑡

? ? ?

… … … … … … … …

? ? ?

𝑑

𝑤𝑖 = 𝑏𝑟𝑜𝑎𝑑𝑐𝑎𝑠𝑡

𝑤𝑗 = 𝑛𝑒𝑤𝑠 𝑤𝑘 = 𝑟𝑒𝑝𝑜𝑟𝑡𝑒𝑟 𝑤𝑚 = 𝑐𝑒𝑖𝑙𝑖𝑛𝑔

94 56 60 0

𝑤𝑙 = 𝑑𝑜

𝑉

Page 9: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Embeddings capture meaning

Word 1 Word 2 Human Embeddin

horse car 5.9 0.79

book paper 7.46 0.85

computer keyboard 7.62 0.79

train car 6.31 0.5

television radio 6.77 0.73

drug abuse 6.85 0.45

bread butter 6.19 0.65

cucumber potato 5.92 0.75

doctor nurse 7 0.84

smart stupid 5.81 0.6

stock market 8.08 0.97

r=.72

Problem breakdown

𝑐𝑜𝑠𝑖𝑛𝑒 𝑠𝑖𝑚𝑖𝑙𝑎𝑟𝑖𝑡𝑦 𝑤1, 𝑤2 =𝑤1 ∙ 𝑤2

𝑤1 ∙ 𝑤2 ℳ

But how did we come up with that conclusion?

Page 10: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Embeddings capture meaning

Word 1 Word 2 Human Embeddin

horse car 5.9 0.79

book paper 7.46 0.85

computer keyboard 7.62 0.79

train car 6.31 0.5

television radio 6.77 0.73

drug abuse 6.85 0.45

bread butter 6.19 0.65

cucumber potato 5.92 0.75

doctor nurse 7 0.84

smart stupid 5.81 0.6

stock market 8.08 0.97

r=.72

Problem breakdown

𝑐𝑜𝑠𝑖𝑛𝑒 𝑠𝑖𝑚𝑖𝑙𝑎𝑟𝑖𝑡𝑦 𝑤1, 𝑤2 =𝑤1 ∙ 𝑤2

𝑤1 ∙ 𝑤2 ℳ

Vectors capture semantic meaning

Vectors capture only semantic meaning

But how did we come up with that conclusion?

Page 11: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Embeddings capture meaning

Problem breakdown

𝑐𝑜𝑠𝑖𝑛𝑒 𝑠𝑖𝑚𝑖𝑙𝑎𝑟𝑖𝑡𝑦 𝑤1, 𝑤2 =𝑤1 ∙ 𝑤2

𝑤1 ∙ 𝑤2 ℳ

Vectors capture semantic meaning

Vectors capture only semantic meaning

But how did we come up with that conclusion?

What is noise?

A confound.

Any unwanted variable that influence our

metric.

Page 12: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Problem breakdown

Page 13: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Change to a word’s embeddings between two

time points [word relative to itself]

Semantic change definition

∆𝑤𝑡0→𝑡1= 𝑐𝑜𝑠𝐷𝑖𝑠𝑡 𝑤𝑡0 , 𝑤𝑡1 = 1 −

𝑤𝑡0 ∙ 𝑤𝑡1

𝑤𝑡0 ∙ 𝑤𝑡1

Problem breakdown

ℳ2

Page 14: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Semantic change validated? Word 1 Word 2 Human Embedding

horse car 5.9 0.79

book paper 7.46 0.85

computer keyboard 7.62 0.79

train car 6.31 0.5

television radio 6.77 0.73

drug abuse 6.85 0.45

bread butter 6.19 0.65

cucumber potato 5.92 0.75

doctor nurse 7 0.84

smart stupid 5.81 0.6

stock market 8.08 0.97

𝑐𝑜𝑠𝑖𝑛𝑒 𝑠𝑖𝑚𝑖𝑙𝑎𝑟𝑖𝑡𝑦 𝑤1, 𝑤2 =𝑤1 ∙ 𝑤2

𝑤1 ∙ 𝑤2

Problem breakdown

Page 15: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Semantic change validated? Word 1 Word 2 Human Embedding

horse car 5.9 0.79

book paper 7.46 0.85

computer keyboard 7.62 0.79

train car 6.31 0.5

television radio 6.77 0.73

drug abuse 6.85 0.45

bread butter 6.19 0.65

cucumber potato 5.92 0.75

doctor nurse 7 0.84

smart stupid 5.81 0.6

stock market 8.08 0.97

𝐛𝐫𝐨𝐚𝐝𝐜𝐚𝐬𝐭𝟏𝟗𝟐𝟎 𝐛𝐫𝐨𝐚𝐝𝐜𝐚𝐬𝐭𝟏𝟖𝟖𝟎

𝐛𝐫𝐨𝐚𝐝𝐜𝐚𝐬𝐭𝟏𝟗𝟔𝟎

𝑐𝑜𝑠𝑖𝑛𝑒 𝑠𝑖𝑚𝑖𝑙𝑎𝑟𝑖𝑡𝑦 𝑤𝑡1, 𝑤𝑡2 =𝑤𝑡1 ∙ 𝑤𝑡2

𝑤𝑡1 ∙ 𝑤𝑡2

Problem breakdown

Page 16: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Semantic change validated? Word 1 Word 2 Human Embedding

horse car 5.9 0.79

book paper 7.46 0.85

computer keyboard 7.62 0.79

train car 6.31 0.5

television radio 6.77 0.73

drug abuse 6.85 0.45

bread butter 6.19 0.65

cucumber potato 5.92 0.75

doctor nurse 7 0.84

smart stupid 5.81 0.6

stock market 8.08 0.97

𝐛𝐫𝐨𝐚𝐝𝐜𝐚𝐬𝐭𝟏𝟗𝟐𝟎 𝐛𝐫𝐨𝐚𝐝𝐜𝐚𝐬𝐭𝟏𝟖𝟖𝟎

𝐛𝐫𝐨𝐚𝐝𝐜𝐚𝐬𝐭𝟏𝟗𝟔𝟎

𝑐𝑜𝑠𝑖𝑛𝑒 𝑠𝑖𝑚𝑖𝑙𝑎𝑟𝑖𝑡𝑦 𝑤𝑡1, 𝑤𝑡2 =𝑤𝑡1 ∙ 𝑤𝑡2

𝑤𝑡1 ∙ 𝑤𝑡2

ℳ2

Problem breakdown

Gulordava and Baroni (2011)

Page 17: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Semantic change validated? Word 1 Word 2 Human Embedding

horse car 5.9 0.79

book paper 7.46 0.85

computer keyboard 7.62 0.79

train car 6.31 0.5

television radio 6.77 0.73

drug abuse 6.85 0.45

bread butter 6.19 0.65

cucumber potato 5.92 0.75

doctor nurse 7 0.84

smart stupid 5.81 0.6

stock market 8.08 0.97

𝐛𝐫𝐨𝐚𝐝𝐜𝐚𝐬𝐭𝟏𝟗𝟐𝟎 𝐛𝐫𝐨𝐚𝐝𝐜𝐚𝐬𝐭𝟏𝟖𝟖𝟎

𝐛𝐫𝐨𝐚𝐝𝐜𝐚𝐬𝐭𝟏𝟗𝟔𝟎

𝑐𝑜𝑠𝑖𝑛𝑒 𝑠𝑖𝑚𝑖𝑙𝑎𝑟𝑖𝑡𝑦 𝑤𝑡1, 𝑤𝑡2 =𝑤𝑡1 ∙ 𝑤𝑡2

𝑤𝑡1 ∙ 𝑤𝑡2

ℳ2

Problem breakdown

SemEval-2020. Coming soon…

Gulordava and Baroni (2011)

Page 18: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Interim summary

Problem breakdown

Meaning

Noise

Meaning

Meaning Meaning

Noise

Meaning

Noise

Meaning

What we think word embeddings capture What word embeddings REALLY capture

What we think

models of semantic change capture

What

models of semantic change REALLY capture

Page 19: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

All models are wrong

1. How wrong are they?

2. Are they importantly wrong?

Depends on what do we use these models for

noise Signal

Working with faulty models

Page 20: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

When to worry about noise

Working with faulty models

1. Word embedding as some proxy for meaning

– Machine translation, chat bots…

– Detecting semantic change

From Kim et al. (2014)

Page 21: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

When to worry about noise

Working with faulty models

1. Word embedding as some proxy for meaning

– Machine translation, chat bots…

– Detecting semantic change

2. Word embedding as the object of study

– Over interpret differences in embeddings that

actually stem from noise.

– Laws of semantic change

Page 22: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

1. Association via correlation • Detecting semantic change

• Associative “laws” of semantic change

What happens with what

2. Intervention / dissociation • Not just observing, but changing.

• Manipulating variables to establish

causation.

What causes what

3. Counterfactuals • Real causation

• True laws of semantic change

What causes what

Working with faulty models

Levels of reasoning

Page 23: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Working with faulty models

Risks in associative reasoning

Ch

ange

Page 24: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Working with faulty models

Risks in associative reasoning

Ch

ange

Time

Page 25: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Working with faulty models

Risks in associative reasoning

Ch

ange

Frequency

Page 26: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Working with faulty models

Risks in associative reasoning

Ch

ange

Frequency

A C B

Page 27: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Working with faulty models

Risks in associative reasoning

Ch

ange

Frequency

A C B

Randomized Controlled Trials

Page 28: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Randomized Controlled Trials Case I

Laws of semantic change

Page 29: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

How wrong models are?

Laws of semantic change

True effect size

From Dubossarsky et al. (2017)

Noise threshold (physical boundary)

Page 30: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Equal effect sizes for the genuine historical corpus and the shuffled historical corpus (Dubossarsky et al. 2017).

Are they importantly wrong?

r = -.748, p<.001

Genuine historical corpus r = -.747, p<.001

Shuffled historical corpus

Laws of semantic change

Page 31: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

• Law of Prototypicality (Dubossarsky et. al. 2015).

“Laws” of semantic change

Laws of semantic change

Page 32: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

• Law of Prototypicality (Dubossarsky et. al. 2015).

• Law of Innovation (Polysemy, Hamilton et. al. 2016).

“Laws” of semantic change

Bank invest

river

rock

sea

money deposit

hours

liquor

drink

glass

booze

bottle

bar

Wine

Laws of semantic change

Page 33: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

• Law of Prototypicality (Dubossarsky et. al. 2015).

• Law of Innovation (Polysemy, Hamilton et. al. 2016).

• Law of Conformity (Frequency, Hamilton et. al. 2016).

“Laws” of semantic change

Laws of semantic change

Page 34: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

% E

xpla

ined v

aria

nce

Frequency Polysemy Prototypicality

Hamilton et al. (2016)

Bank invest

river

rock

sea

money deposit

hours

11%

5%

Laws of semantic change

Genuine corpus

Shuffled corpus

True effect size

Associative laws of

semantic change

Page 35: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

% E

xpla

ined v

aria

nce

Frequency Polysemy Prototypicality

Hamilton et al. (2016)

Bank invest

river

rock

sea

money deposit

hours

11%

5%

Laws of semantic change

Genuine corpus

Shuffled corpus

True effect size

Associative laws of

semantic change

All models are wrong, but some are useful.

Page 36: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Randomized Controlled Trials Case II

General framework to compare models’ noise levels and quality

Page 37: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Evaluate noise levels

Comparing models’ noise and quality

Model A

Model B Model D

Model C

Page 38: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Evaluate noise levels

Comparing models’ noise and quality

Model A

Model B Model D

Model C

How wrong are they?

Page 39: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Evaluate noise levels

Comparing models’ noise and quality

(Dubossarsky et al. 2019)

Model A

Model B Model D

Model C

Page 40: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Synthetic semantic change

1. A wedding ring A wedding ring [100%]

No bracelet!

2. A wedding ring A wedding ring [100%]

An arm bracelet An arm ring [25%]

3. A wedding ring A wedding ring [100%]

An arm bracelet An arm ring [50%]

……

4. A wedding ring A wedding ring [100%]

An arm bracelet An arm ring [100%]

Comparing models’ noise and quality

Page 41: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

1. A wedding ring A wedding ring [100%]

No bracelet!

2. A wedding ring A wedding ring [100%]

An arm bracelet An arm ring [25%]

3. A wedding ring A wedding ring [100%]

An arm bracelet An arm ring [50%]

……

4. A wedding ring A wedding ring [100%]

An arm bracelet An arm ring [100%]

Synthetic semantic change

Comparing models’ noise and quality

Synthetic change words Synthetic stable words

Page 42: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Synthetic semantic change

Comparing models’ noise and quality

Model D Model C

Model A Model B

Page 43: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Evaluate model sensitivity

Comparing models’ noise and quality

Are they importantly

wrong?

Page 44: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Evaluate model sensitivity

Naïve classifier

if 2=<peak_position=<5:

semantic_change = True

else:

semantic_change = False

Model A Model B Model C Model D

Synthetic change

Comparing models’ noise and quality

Page 45: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Evaluate model sensitivity

Model A Model B Model C Model D

Comparing models’ noise and quality

Model A

Model B

Model C

Model D

True semantic change

Page 46: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Evaluate model sensitivity

Model A Model B Model C Model D

Comparing models’ noise and quality

Model A

Model B

Model C

Model D

True semantic change

All models are wrong, but some are useful.

And some are more useful than other!

Page 47: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Test your models!

• Use randomized control tests to evaluate levels

of noise and alleviate confounds in models.

• Simulate the phenomenon you are investigating.

• Test models’ performance on simulated data.

• Not limited to word embedding!

Conclusions Final notes

Page 48: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Doing it right

Historical distributional semantics

Page 49: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

Doing it right

Historical distributional semantics

Page 51: Semantic Change in the Time of Machine Learning · cucumber potato 5.92 0.75 doctor nurse 7 0.84 smart stupid 5.81 0.6 stock market 8.08 0.97 r=.72 Problem breakdown 𝑖 𝑖 𝑖

[email protected]

Thank you!

SemEval-2020. Coming soon…