Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a...

76
Keyness Analysis for (Critical) Discourse Studies: What, How, Why Costas Gabrielatos Edge Hill University [email protected] ehu.ac.uk/gabrielatos Digital humanities beyond the basics University of Bergamo, Italy, 13 September 2018

Transcript of Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a...

Page 1: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Keyness Analysis for (Critical) Discourse Studies:

What, How, Why

Costas GabrielatosEdge Hill University

[email protected]/gabrielatos

Digital humanities beyond the basicsUniversity of Bergamo, Italy, 13 September 2018

Page 2: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Based on

Gabrielatos (2013b, 2017, 2018)

and

Gabrielatos & Marchi (2011, 2012)

Page 3: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Focus

Part 1• Nature of keyness• Unit of analysis• Foci, types & utility

Part 2• Corpus selection

Part 3• Appropriate metrics

Part 4• Selection of key

items

Part 5• Example study

Part 6• Conclusions and

recommendations

Page 4: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Part 1

Nature of keynessUnit of analysis

Foci, types & utility

Page 5: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Keyness Analysis: Nature

• Distinctiveness in terms of frequency.

• Comparisons of normalised frequency between two (sub-)corpora.

Page 6: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

A note on frequency

• Raw frequency

– The number of times an item is found in a corpus

• Normalised (relative) frequency

–Per 100 words (%), 1000 words, 1 mil. words etc.

• Why normalised frequencies?

They enable frequency comparisons between (sub-)corpora.

Page 7: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

The need for normalised frequencies

Raw

Corpus 1 100

Corpus 2 100

A: We can’t tell. We need to know the size ofthe two corpora.

Q: Does this item have the same frequency inthe two corpora?

Page 8: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

The need for normalised frequencies

Raw

Corpus 1 100

Corpus 2 100

Q: Does this item have the same frequency inthe two corpora?

A: No, it’s five times more frequent in Corpus 1

Corpus size Normalised

1,000,000 100 pmw

5,000,000 20 pmw

[For other normalisation options, see Gabrielatos (2013)]

Page 9: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Keyness: Unit of analysis

• Usually word-forms– Hence, the common term ‘keyword analysis’

• However, focus can also be on any

lexicogrammatical, phonological,

semantic, pragmatic, or discoursal unit.

A better term: key item (Wilson, 2013)

Page 10: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Keyness: possible units of analysis

• Word-forms• Word-form + POS (running_VVG v. running_JJ)• Lemmas (run, runs, running, ran)• POS tags• MWUs (in light of)• n-grams• Lexicogrammatical units (BE interested in)• Constructions (conditionals)• Semantic fields/attributes• Pragmatic functions• Discourse / conversational units

Page 11: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Keyness Analysis: Foci

So far, predominantly difference.

• However, the focus can also be …

Absence (Duguid & Partington, 2018; Partington, 2014).

Similarity (Taylor, 2011, 2013, 2018).

Both can be seen in relation to difference (Gabrielatos, 2018):

–absence: extreme case of difference– similarity: lack of difference

Two types of keyness: Keyness-D, Keyness-S(Gabrielatos, 2018)

Page 12: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Two types of Keyness Analysis

Exploratory• Frequency comparison of all items.• Concordance analysis of the key items to establish

patterns of difference, absence, and similarity.This is how keyness analysis is overwhelmingly used

in corpus-based discourse studies.

Focused• Frequency comparisons of particular items.• Concordance analysis.Addresses particular research questions or

hypotheses.

(Gabrielatos, 2018; Gries, 2010: 285; Partington, 2009: 286)

Page 13: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Keyness Analysis: Utility

Frequency differences/similarities

Differences / similarities in topic / attitude / stance / ideology

Page 14: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Part 2

Selecting Corpora

Page 15: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Corpus Selection: Terms

Study Corpus• The (sub-)corpus we want to investigate.

Reference Corpus• The (sub-)corpus we compare the study

corpus to.

Page 16: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Corpus Selection: Considerations (1)

The comparison must be meaningful

Any frequency differences/similarities must lend themselves

to useful interpretations.

Page 17: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Corpus Selection: Considerations (2)

Ideally, the two corpora will differ in only one respect• Medium (speech, writing, written-to-be-spoken)

• Topic• Time• Language Variety• Nativeness (L1/L2)

• Expertise/Level (e.g. professional vs. novice writers)

• Text Type• Source (e.g. newspaper A vs. newspaper B)

Page 18: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Corpus Selection: Considerations (3)

Possible comparisons

Variety A vs. Variety B

L1/L2 spoken vs. L1/L2 written

L1 spoken/written vs. L2 spoken/written

Expert vs. Novice/Learner

Topic-specific vs. General

Topic A vs. Topic B

Time-period 1 vs. Time-period 2

Page 19: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Corpus Selection: Considerations (4)

Make-up of the two corpora

• Keyness analysis using whole corpora makesthe implicit assumption that they arehomogeneous – they may not be.

• If they are not homogeneous, then keynessanalysis must be carried out between sub-corpora.– e.g. newspaper corpus: tabloid v. broadsheet,

but also left-leaning v. right-leaning.

Page 20: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Corpus Selection: Addressing Misconceptions

• The reference corpus does not need to be a general corpus.

• The distinction between study and referencecorpus is just one of focus.– We can compare two corpora (A and B) twice:

• A is the study corpus, B is the reference corpus.• B is the study corpus, A is the reference corpus.

• Study and reference corpora can be of any size.– The study corpus does not need to be smaller than

the reference corpus.– Corpus size differences are taken into account by

statistical significance tests.

Page 21: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Part 3

Appropriate Metrics

Page 22: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Keyness Metrics: A fundamental distinction

Effect size• The size of the frequency difference.

Statistical significance• The degree of confidence that the

observed frequency difference isdependable.

Page 23: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Effect size v. Statistical significanceWhat do they measure?

• Effect size “indicates the magnitude of an observedfinding” (Rosenfeld & Penrod, 2011: 342).

• Effect size “is a measure of the practical significance of aresult, preventing us claiming a statistical significant resultthat has little consequence” (Ridge & Kudenko, 2010: 272).

• “Just because a particular test is statistically significantdoes not mean that the effect it measures is meaningful orimportant” (Andrew et al., 2011: 60).

• “A very significant result may just mean that you have alarge sample. [...] The effect size will be able to tell uswhether the difference or relationship we have found isstrong or weak.” (Mujis, 2010: 70).

Frequency difference and statistical significance are not the same

Page 24: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

“Tests of statistical significance are dependent on the samplesize used to calculate them. [...] With very large sample sizes,even very weak relationships can be significant. Conversely,with very small sample sizes, there may not be a significantrelationship between the variables even when the actualrelationship between the variables in the population is quitestrong. Therefore, different conclusions may be drawn indifferent studies because of the size of the samples, ifconclusions were drawn based only on statistical significancetesting. Unlike tests of significance, effect size estimates arenot dependent on sample size. Therefore, another advantageof using effect size estimates is that they provide informationthat permits comparisons of these relationships acrossstudies” (Rosenfeld & Penrod, 2011: 84).

Effect size v. Statistical significanceThe influence of corpus size

Page 25: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Statistical Significance: Background

• The log-likelihood test (G2 or LL) was notdeveloped for keyness analysis.– Proposed by Dunning (1993) to accurately

identify the statistical significance of rare events.

• When corpus linguistics was about to adopta stat.sig. metric to measure frequencydifferences, researchers in other fields (e.g.STEM, psychology) were vocally challengingits use as the main/only metric in theirstudies (e.g. Thompson, 1998).

Page 26: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Gabrielatos & Marchi (2011, 2012) carried out keynessanalyses using different corpora (small/large,general/specialised), and compared the rankingsresulting from sorting items according to the effect sizescore (%DIFF) and the statistical significance score (LL).

They tested the degree that the two rankingscoincided, using Spearman’s Rank Correlation (rs).• Rs = 1 indicates full correlation• Rs = 0 indicates no correlation

They found almost no correlation (see also Brown2018: 237-26.2)

Effect size v. Statistical significanceComparing rankings

Page 27: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Putative full positive correlation (rs= 1)between LL and %DIFF rankings

Page 28: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Actual correlation of rankings: SiBol 1993 v. SiBol 2005 (rs= 0.010)

Page 29: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Actual correlation of rankings: Guardian 2005: Media (1 mil.) v. Home (rs= 0.040)

Page 30: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Actual correlation of rankings: Hutton and BNC (rs= 0.094)

Page 31: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Keyness: Appropriate Metrics

• The main metric should reflect the size of thefrequency difference (Bestgen 2017; Gabrielatos 2017, 2018;

Gabrielatos & Marchi 2011, 2012; Gries 2010; Kilgarriff 2001, 2009).

Only purely effect-size metrics are appropriate. Key items are sorted by the effect-size score.

• When the focus is difference or absence, it isuseful to check if the observed difference isreliable. Additional statistical significance testing.

• When the focus is similarity, statistical significancetesting is irrelevant.

Page 32: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Keyness Metrics

Effect size metrics• Difference Coefficient (Hofland & Johansson 1982)

• Simple Ratio (Kilgarriff, 2009)

• %DIFF (Gabrielatos & Marchi, 2011)

• Log-ratio (Hardie, 2014)

• Odds Ratio (Pojanapunya & Watson Todd, 2016)

Statistical significance metrics• log likelihood (G2 or LL) (Rayson et al. 2004)

• Bayes Information Criterion (BIC) (Wilson 2013)

Page 33: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Simple Ratio =NFC1NFC2

Odds Ratio =RFC1 / (C1 – RFC1)RFC2 / (C2 – RFC2)

Log Ratio = logNFC1NFC2

%DIFF =(NFC1 – NFC2) * 100

NFC2

Diff Coefficient =NFC1 – NFC2NFC1 + NFC2

RF = raw frequency NF = normalised frequencyC1 = corpus 1 C2 = corpus 2

Page 34: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Statistical Significance: Nature (1)

Stat.sig. tests examine the null hypothesis (H0).• In keyness analysis, H0 is that there is no

frequency difference.• If p-value higher than threshold, H0 is rejected.

Statistical significance scores are sensitive to …• the item frequency in the two corpora;• the sizes of the two corpora.

The larger the item frequencies and/orcorpora, the smaller the differences thatwill be significant.

Page 35: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

An item can have very high effect size but(relatively) low stat.sig.The stat.sig. value does not accurately

reflect the size of a difference.

An item can have very high stat.sig. but verylow effect size.The high stat.sig. simply indicates that we

can be highly confident that there is a verysmall frequency difference.

Problems with using stat.sig. insteadof effect size to measure keyness (1)

Page 36: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Different items can have very different stat.sig.but similar effect sizes.Using stat.sig. values to measure keyness can

exclude true key items from the analysis.

Different items can have similar stat.sig. butvery different effect sizes.Using stat.sig. values to measure keyness can

result in treating a low-level key item as ahigh-level one – or a non-key item as key.

Problems with using stat.sig. insteadof effect size to measure keyness (2)

Page 37: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Statistical Significance: Nature (2)

In a keyness analysis, stat.sig. scores showthe extent to which the compared corporaare large enough and/or the item is frequentenough for an observed sizeable frequencydifference to be reliable.

It makes no sense to test for stat.sig.when the focus is similarity (i.e. whenfrequency differences are very small).

Page 38: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Confidence is not enough – and it may mislead

Page 39: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Statistical Significance: Metrics & Thresholds (1)

As corpus data cannot be expected to havenormal distribution, the chi-squared test(X2) is not appropriate, with the log-likelihood test (G2) used instead (Rayson et al.,

2004).

The p-value taken as the threshold forstat.sig. varies from study to study, rangingfrom p=0.01 to p=0.00000000001 (or evenhigher) (Pojanapunya, & Watson Todd, 2016).

Page 40: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Statistical Significance: Metrics & Thresholds (2)

Sensitivity of stat.sig. values to itemfrequencies and corpus sizes• The same p-value may represent different

degrees of reliability in different comparisons(Wilson, 2013: 7-8).

Solution• Using the Bayesian Information Criterion:

BIC ≈ LL – log(N) (Wilson, 2013: 6).

• BIC can be seen as a way to normalise p-values BIC values can be compared acrossdifferent keyness analyses.

Page 41: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Interpreting BIC Values

BIC Degree of evidence against H0

<0 No evidence – favours H0

0-2 Not worth more than a bare mention

2-6 Positive evidence against H0

6-10 Strong evidence against H0

>10 Very strong evidence against H0

Page 42: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

(Gabrielatos, 2014)

Keyness-D at a glance

Page 43: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Part 4

Selecting Key Items for Analysis

Page 44: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Selecting KIs for manual analysis: Current approaches

• Selection of top-N items.

• Setting a high item frequencythreshold.

• Setting a high stat.sig. threshold.

(Pojanapunya & Watson Todd, 2016: 3-10)

Page 45: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

‘Top-N’ technique: Problems

KI1

KI2

KI3

KI4

KI5

KI6

KI7

KI8

KI9

KI10

KI11

KI1

KI2

KI3

KI4

KI5

KI5

KI6

KI7

KI8

KI9

KI10

KI11

Blue• Implicit

assumption/expectation

Green• Usual reality

(Gabrielatos, 2017)

Page 46: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

‘Top-N’ technique: Problems

• Does not take account of the distancebetween the effect-size of consecutiveitems.

• In fact, it implicitly assumes equal distance.

• Item ‘N’ may have …– a very similar effect-size to item ‘N+1’

(excluded from analysis),–while having a large difference with item

‘N-1’ (included in analysis).

Page 47: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Setting a high item frequency threshold: Problems

• May exclude important absences.

• May exclude very large frequencydifferences by excluding items with very lowfrequency in corpus 1 but very highfrequency in corpus 2.

“It is a brave, or rather foolish, analyst whoassumes that, in any given data set, thewords are so unlikely to be key that they canbe safely ignored from the very start.”

(McEnery, 2006: 148)

Page 48: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Setting an extremely high statistical significance threshold: Problems

• May exclude very large frequencydifferences simply because they don’t haveextremely high statistical significance.– e.g. large frequency differences between mid-

freq. items

• May include small(er) frequency differencesjust because of extremely high statisticalsignificance.– e.g. very small frequency differences between

extremely high frequency items.

Page 49: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Selecting KIs: Cluster Analysis

• Before clustering, all items are treated ascandidate key items (CKIs)

• CKIs are grouped in a predetermined numberof clusters, according their respectivefrequency differences.

• KIs are selected in whole clusters, startingfrom the top one.– If an item in a cluster is selected for analysis, all

other items in that cluster must also beselected.

– Avoids cherry-picking.(For details, see Gabrielatos, 2018)

Page 50: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Clustering: approach and settings (1)

• Hierarchical cluster analysis: agglomerativemethod.

– Bottom-up: initially, each CKI treated as a cluster.

• (Dis)similarity measured via Euclidian distance

– Square root of sum of squares of pairwise differences between effect size scores.

• Distance between clusters measured via average group linkage

– Average of distances between all effect-size scores in each cluster.

Page 51: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Clustering: approach and settings (2)• Predetermined number of clusters

– Accommodates the usual restriction in the number of KIs that can be manually examined.

No of clusters =Total N of CKIs

N of CKIs to be examined

• No item frequency cut-off.– Caters for focus on absence.– Caters for large frequency differences due to very

low item frequency in one of the corpora.

• Initially, no statistical significance cut-off.– Caters for focus on similarity.

Page 52: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Clustering: approach and settings (3)Whatever the focus• Clustering according to effect-size (%DIFF).• Items in the same cluster are equally key Selection of whole clusters.

Keyness-D• Before clustering, frequency differences below set

statistical significance threshold are removed.• CKI clusters selected starting with cluster containing

the largest frequency differences.

Keyness-S• No statistical significance filtering.• CKI clusters selected starting with cluster containing

the from smallest frequency differences.

Page 53: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Clustering Approach: summary

• Primarily takes into account effect-size.– Clustering according to effect-size.

• Caters for focus on both difference(keyness-D) and similarity (keyness-S).

• When focus is on difference, stat.sig. isadded as a secondary consideration.

• Avoids pre-filtering (no frequencythresholds).– Initially, all items are regarded as

candidate key items (CKIs)

Page 54: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

But this is not the end

Identifying CKIs is not enough.• Simply looking at a list of linguistic items does not

help us identify aspects of content (e.g. topics,attitudes) …

• … although it may help us formulate initialhypotheses.

We need to examine instances of each key item inenough co-text (a.k.a concordance analysis):• Annotation of each instance – keeping in mind

the relevant context.• Quantification of results.

Page 55: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Part 5

Example Study

Page 56: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Corpora

• 2017 UK election Conservative and Labour manifestos. – CM2017: 29,954 words; LM2017: 23,691 words.

• Texts from Paul Rayson’s Wmatrix webpage (Rayson, 2009): http://ucrel.lancs.ac.uk/wmatrix/ukmanifestos2017

• Further manual cleaning to (fully) remove – page numbers– chapter/section numbers– headers and footers– characters indicating bullet points and quotation

marks

Page 57: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Tools & Metrics

Corpus tools: • WordSmith 7 (Scott, 2016)

• Paul Rayson’s effect-size+stat.sig. Excel spreadsheet (Rayson, 2009)

Cluster analysis: SPSS 22

Effect-size metric: %DIFF (Gabrielatos & Marchi, 2011)

Stat.sig metrics: G2 and BIC (threshold value: 2)

Page 58: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Absences

United Labour’s banks

Kingdom equality renters

universities unions women’s

shall LGBT failure

shale reinstate enforce

scrap extending

privatisation centres

In CM, but not in LM In LM, but not in CM

Page 59: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

universities in Conservatives Manifesto

Page 60: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

scrap in Labour Manifesto

Page 61: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Largest Differences

CKIs in CM2017Smallest %DIFF: 79.14

UNITED, KINGDOM, UNIVERSITIES, SHALL, SHALE, STABLE, DATABELIEVE, GENERATIONS, GO, ONLINE, IF, INSTITUTIONS, LEADERSHIP, TECHNICAL, OPPORTUNITY, TECHNOLOGY, DIGITAL, GREAT, STRONG, BETTER, WANT, HELP, UNION, WORLD, DO, CONTINUE, BEST, SO, CAN, WE

CKIs in LM2017Smallest %DIFF: 61.81

LABOUR'S, EQUALITY, UNIONS, LGBT, REINSTATE, SCRAP, PRIVATISATION, BANKS, RENTERS, WOMEN'S, FAILURE, ENFORCE, EXTENDING, CENTRES, LABOUR, CUTS, OFFICERS, OWNERSHIP, CRISIS, GUARANTEE, REGIONAL, ARRANGEMENTS, VITAL, STAFF, RIGHTS, WOULD, WORKERS, STANDARDS, UNDER, BACK, CONSERVATIVES, JOBS, ALL, ON

Page 62: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

The effect of changing the stat.sig. threshold

• Let’s compare the CKIs returned when BIC≥2(in this study, corresponding to p≤0.001) withthose returned with the less strict statisticalsignificance threshold of p≤0.01.

• Do we simply get more CKIs, added after thelowest ranking CKI derived with the stricterstat.sig. threshold?

• CKIs returned when BIC≥2 (p≤0.001) are marked in yellow.

Page 63: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Cluster Difference: CKIs in CM2017 (p<0.01)

1 1:UNITED

2 2:KINGDOM

3 3:UNIVERSITIES

4 4:SHALL

5 5:SHALE

6 6:YOUNGER; 7:AHEAD; 8:YOUR

7 9:EASIER; 10:MERITOCRACY

811:DESIGN; 12:MIGHT ; 13:ELDERLY; 14:COMPETITIVE; 15:DEEP; 16:ACTIVE;

17:ATTRACT; 18:PUPILS

9

19:EXCEPTIONAL; 20:THINGS; 21:LEADERS; 22:WRONG; 23:GLOBE;

24:EDINBURGH; 25:REGULATORS; 26:EXPLORE; 27:COMBAT; 28:WORRY;

29:GOVERN

10

30:STABLE; 31:DATA; 32:PROSPEROUS; 33:DIFFICULT; 34:FRAMEWORK;

35:BELIEVE; 36:MUCH; 37:GENERATIONS; 38:GO; 39:INFORMATION;

40:ONLINE; 41:IF; 42:INSTITUTIONS; 43:LEADERSHIP; 44:TECHNICAL;

45:OPPORTUNITY; 46:TECHNOLOGY; 47:OLD; 48:SIGNIFICANT; 49:POOR;

50:DIGITAL; 51:GREAT; 52:REMAIN; 53:WORLD'S; 54:STRONG;

55:PARTNERSHIP; 56:THERESA; 57:BETTER; 58:WANT; 59:MARKETS;

60:STRONGER; 61:HELP; 62:INTERESTS; 63:PROSPERITY; 64:NATION;

65:UNION; 66:GREATER; 67:NOW; 68:WORLD; 69:DO; 70:TOGETHER; 71:LEAVE;

72:SCHOOL; 73:CONTINUE; 74:BEST; 75:EUROPEAN; 76:RIGHT; 77:SHOULD;

78:ABOUT; 79:USE; 80:AROUND; 81:TAKE; 82:BRITISH; 83:SO; 84:THOSE;

85:CAN; 86:MAKE; 87:WE; 88:THIS; 89:IT; 90:BRITAIN; 91:PEOPLE; 92:IN

Page 64: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Clusters Difference: CKIs LM2017 (p<0.01)

1 1:LABOUR'S

2 2:EQUALITY

3 3:UNIONS

4 4:LGBT

5 5:REINSTATE

6 6:SCRAP

7 7:PRIVATISATION; 8:BANKS

89:RENTERS; 10:WOMEN'S; 11:FAILURE; 12:ENFORCE; 13:EXTENDING; 14:CENTRES;

15:NEGOTIATING; 16:PROBATION; 17:ADULT

9

18:PROCUREMENT; 19:INSECURE; 20:WAGES; 21:HIV; 22:TOURISM; 23:PRIORITISE;

24:REINTRODUCE; 25:PROFIT; 26:YOUTH; 27:TRANSITION; 28:REVERSE;

29:RESOLUTION; 30:NEGLECT; 31:ABOLISH; 32:PROFITS; 33:MATERNITY;

34:OPERATIVE; 35:UNLIKE; 36:LIBRARIES; 37:RECOGNITION; 38:LATE;

39:CONTROLS; 40:HANDS; 41:BALANCE; 42:MUSIC; 43:DELIVERS; 44:JUDICIAL;

45:OPTIONS; 46:FARES

10

47:LABOUR; 48:CUTS; 49:OFFICERS; 50:UN; 51:FAILED; 52:OWNERSHIP; 53:EQUAL;

54:ECONOMIES; 55:CRISIS; 56:WAR; 57:FORMS; 58:PEACE; 59:ALLOWANCE;

60:TARGETS; 61:FEES; 62:GUARANTEE; 63:REGIONAL; 64:LEGISLATION;

65:TRADING; 66:ARRANGEMENTS; 67:VITAL; 68:STAFF; 69:LED; 70:RANGE;

71:PLANS; 72:RIGHTS; 73:HOURS; 74:TOWARDS; 75:WOULD; 76:FULLY; 77:OWNED;

78:WORKERS; 79:DISABILITIES; 80:STANDARDS; 81:DISCRIMINATION; 82:FOOD;

83:UNDER; 84:BACK; 85:CLIMATE; 86:CONSULT; 87:CUT; 88:CONSERVATIVES;

89:PRIVATE; 90:JOBS; 91:ENVIRONMENTAL; 92:TRANSPORT; 93:INVEST;

94:WOMEN; 95:EMPLOYMENT; 96:SECTOR; 97:HOMES; 98:END; 99:MANY; 100:ALL;

101:FUNDING; 102:PROTECT; 103:REVIEW; 104:BEEN; 105:COMMUNITIES; 106:INTO;

107:ON

Page 65: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

The effect of changing the stat.sig. threshold

• Do we simply get more CKIs, added after thelowest-ranking CKI derived with the stricter

stat.sig. threshold? NO The ranking changes! The clustering changes!

Why? The lower stat.sig. threshold adds some items

with larger frequency differences than someof the items returned by the stricterthreshold.

Page 66: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Part 6

Conclusions & Recommendations

Page 67: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Conclusions (1)

• Keyness can refer to difference (includingabsence) or similarity.

• Keyness refers to the size of the frequencydifference – not the level of confidence.

• The size of the frequency difference isindicated by the effect size score – not thestatistical significance score.

• Keyness analysis can be exploratory orfocused.

• Corpus selection should serve the studyaims.

Page 68: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Conclusions (2)

• Current KI selection techniques lack a soundrationale.

• Clustering CKIs provides a principled,transparent and replicable approach toselecting KIs for manual analysis.

• Pre-filtering can exclude large freq.diffs.and/or include smaller ones.

• Rankings by BIC and G2 values do notcorrespond!

• Lowering the stat.sig. threshold doesn’t justreturn more CKIs, but also different rankings!

Page 69: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Conclusions (3)

• Keyness is not a straightforward attribute.• A keyness analysis (or any quantitative

analysis) does not necessarily entail objectivity.– Decisions regarding thresholds (frequency,

effect-size, statistical significance) aresubjective …… and determine which and how many

items are deemed ‘key’.• A stricter stat.sig. threshold doesn’t

necessarily return items that are ‘more key’.

Page 70: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Recommendations

• Methodological decisions must be principled andexplicitly stated.

• Pre-filtering on the basis of frequency or POSshould be avoided, as it is tantamount to cherry-picking.

• BIC seems a more reliable metric of statisticalsignificance.

• For replicability, studies must report & justify:…any thresholds or pre-filtering;…the inclusion/exclusion of particular (types of)

CKIs;…the proportion of CKIs selected for analysis.

Page 71: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

Keyness Analysis: Steps

1. Formulate research questions orhypotheses.

2. Select or compile appropriate corpora.

3. Carry out frequency comparisons toidentify key items – using appropriatemetrics.

4. Carry out concordance analysis(i.e. explicit annotation).

5. Quantify the results.

6. Interpret results.

Page 72: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

References (1)• Andrew, D.P.S., Pedersen, P.M. & McEvoy, C.D. (2011) Research Methods

and Design in Sport Management. Human Kinetics.• Bestgen, Y. (2017) Getting rid of the Chi-square and Log-likelihood tests

for analysing vocabulary differences between corpora. Quaderns deFilologia: Estudis Lingüístics, 22, 33-56.

• Brown, D.W. (2018) English and Empire: Literary history, dialect, and the digital archive. Cambridge: Cambridge University Press.

• Duguid, A. & Partington, A. (2018) Using corpus linguistics to investigateabsence/s: You don’t know what you’re missing. Or do you? In Taylor, C.& Marchi, A. (eds.) Corpus Approaches to Discourse: A critical review.London: Routledge. 38-59.

• Gabrielatos, C. (2013a). Counting: Word frequency and beyond.Dubrovnik Fall School in Linguistic Methods. Centre for AdvancedAcademic Studies. Dubrovnik, Croatia, 20–26 October 2013. [e-copy]

• Gabrielatos, C. (2013b). Keyword analysis. Dubrovnik Fall School inLinguistic Methods. Centre for Advanced Academic Studies, Dubrovnik,Croatia, 20–26 October 2013. [e-copy]

Page 73: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

References (2)• Gabrielatos, C. (2014). Corpus approaches to discourse studies: The

basics. Discourse & Communication Studies Research Team, ÖrebroUniversity, 12 September 2014. [e-copy]

• Gabrielatos, C. (2017) Clusters of keyness: A principled approach toselecting key items. Corpus Linguistics in the South 15, University ofCambridge, 28 October 2017. [e-copy]

• Gabrielatos, C. (2018) Keyness analysis: Nature, metrics & techniques. InTaylor, C. & Marchi, A. (eds.) Corpus Approaches to Discourse: A criticalreview. London: Routledge.

• Gabrielatos, C. & Marchi, A. (2011) Keyness: matching metrics todefinitions. Corpus Linguistics in the South 1, University of Portsmouth, 5November 2011. [e-copy]

• Gabrielatos, C. & Marchi, A. (2012) Keywords: appropriate metrics andpractical issues. CADS International Conference, Bologna, Italy, 13-15September 2012. [e-copy]

• Gries, S.Th. (2010) Useful statistics for corpus linguistics. In: Sánchez, A. &Almela, M. eds. A mosaic of corpus linguistics: selected approaches.Frankfurt am Main: Peter Lang. 269-291.

Page 74: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

References (3)

• Hardie, A. (2014) Log ratio – an informal introduction. ESRC Centre forCorpus Approaches to Social Science CASS. [online]

• Hofland, K. & Johansson, S. (1982) Word Frequencies in British andAmerican English. Bergen: Norwegian Computing Centre for theHumanities/London: Longman.

• Kilgarriff, A. (2001) Comparing corpora. International Journal of CorpusLinguistics, 6(1), 97–133.

• Kilgarriff, A. (2009) Simple maths for keywords. In Mahlberg, M.,González-Díaz, V. & Smith, C. (eds.) Proceedings of the Corpus LinguisticsConference, CL2009. Liverpool: University of Liverpool. [e-copy]

• Mujis, D. (2010) Doing Quantitative Research in Education with SPSS.Sage.

• Partington, A. (2009) Evaluating evaluation and some concludingthoughts on CADS. In Morley, J. & Bayley, P. (eds.) Corpus-assisteddiscourse studies on the Iraq conflict: wording the war. London:Routledge, 261-303.

• Partington, A. (2014) Mind the gaps: the role of corpus linguistics inresearching absences. International Journal of Corpus Linguistics, 19(1),118–146.

Page 75: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

References (4)• Pojanapunya, P. & Watson Todd, R. (2016) Log-likelihood and odds ratio:

keyness statistics for different purposes of keyword analysis. CorpusLinguistics and Linguistic Theory, 14(1), 133-167.

• Ridge, E. & Kudenko, D. (2010) Tuning an algorithm using design ofexperiments. In Batz-Beiselstein, T., Chiarandini, M., Paquette, L. &Preuss, M. (eds.), Experimental Methods for the Analysis of OptimizationAlgorithms. Springer. 265-286.

• Rayson, P. (2009) Wmatrix: A web-based corpus processing environment,Computing Department, Lancaster University. [online]

• Rayson P., Berridge D. & Francis B. (2004) Extending the Cochran rule forthe comparison of word frequencies between corpora. In Purnelle G.,Fairon C. & Dister A. eds. Le poids des mots: Proceedings of the 7thInternational Conference on Statistical analysis of textual data JADT 2004,Vol.2. Louvain-la-Neuve, Belgium: Presses universitaires de Louvain, 926-936.

• Rosenfeld, B. & Penrod, S.D. (2011) Research Methods in ForensicPsychology. John Wiley and Sons.

Page 76: Keyness Analysis for (Critical) Discourse Studies · •The reference corpus does not need to be a general corpus. •The distinction between study and reference corpus is just one

References (5)• Scott, M. (2016) WordSmith Tools version 7. Stroud: Lexical Analysis

Software.• Taylor, C. (2011) Searching for similarity: The representation of boy/s and

girl/s in the UK press in 1993, 2005, 2010. Corpus Linguistics 2011,University of Birmingham, 20-22 July 2011.

• Taylor, C. (2013) Searching for similarity using corpus-assisted discoursestudies. Corpora 8(1), 81-113.

• Taylor, C. (2018) Similarity. In Taylor, C. & Marchi, A. (eds.) CorpusApproaches to Discourse: A critical review. London: Routledge. 19-37.

• Thompson, B. (1998) Statistical significance and effect size reporting:portrait of a possible future. Research in the Schools, 5(2), 33-38.

• Wilson, A. (2013) Embracing Bayes factors for key item analysis in corpuslinguistics. In Bieswanger, M. & Koll-Stobbe, A. (eds.) New Approaches tothe Study of Linguistic Variability. Frankfurt: Peter Lang. 3-11.