Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

33
Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM

Transcript of Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Page 1: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Appraising A Diagnostic Test

Clinical Epidemiology and Evidence-based Medicine Unit

FKUI-RSCM

Page 2: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

What is diagnosis ?Increase certainty about presence/absence of diseaseDisease severityMonitor clinical courseAssess prognosis – risk/stage within diagnosisPlan treatment e.g., location Stalling for time!

Knottnerus, BMJ 2002

Page 3: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Key Concept

Pre-test Probability– The probability of the target

condition being present before the results of a diagnostic test are available.

Post-test Probability– The probability of the target

condition being present after the results of a diagnostic test are available.

Page 4: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Key Concept

Pre-test Probability– The probability of the target condition

being present before the results of a diagnostic test are available.

Post-test Probability– The probability of the target condition

being present after the results of a diagnostic test are available.

Page 5: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.
Page 6: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Basic Principles (1)

Ideal diagnostic tests – right answers:(+) results in everyone with the

disease and( - ) results in everyone elseUsual clinical practice:–The test be studied in the same

way it would be used in the clinical setting

Observational study, and consists of:–Predictor variable (test result)–Outcome variable (presence /

absence of the disease)

Page 7: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Basic Principles (2)

Sensitivity, specificityPrevalence, prior probability, predictive valuesLikelihood ratiosDichotomous scale, cutoff points (continuous scale)Positive (true and false), negative (true & false)ROC (receiver operator characteristic) curve

Page 8: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

General structure : 2 X 2 table

Target disorderPositive(disease)

Target disorderNegative (normal)

PredictorTest

positive

True positiveTPa

False positiveFPb

PredictorTest

negative

False negative

FNc

True negativeTNd

Page 9: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Disease(+)

Disease(-)

Total

Test (+)True pos

a

False posb

a+b

Test (-)False neg

c

True negd

c+d

Total a+c b+d a+b+c+d a+c

a+b+c+dPrevalence Pretest probability

Page 10: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Sensitivity

The proportion of people who truly have a designated disorder who are so identified by the test.Sensitive tests have few false negatives. When a test with a high Sensitivity is Negative, it effectively rules out the diagnosis of disease. SnNout

Page 11: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Specificity

The proportion of people who are truly free of a designated disorder who are so identified by the test. Specific tests have few false positivesWhen a test is highly specific, a positive result can rule in the diagnosis. SpPin

Page 12: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Disease(+)

Disease(-)

Totals

Test (+) a b a+b

Test (-) c d c+d

Totals a+c b+da+b

+c+d a/a+c d/b+d

Probability of positive test result in patients with the disease

Probability of negative test result

in patients without the disease

Sensitivity Specificity

SnNOut SpPIn

Page 13: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

SnNOutThe sensitivity of dyspnea on exertion for the diagnosis of CHF is 100% (41/(41+0)), and the specificity 17% (35/(183+35)).

If DOE, it is very unlikely that they have CHF (0 out of 41 patients with CHF did not have this symptom).

"SnNOut", which is taken from the phrase: "Sensitive test when Negative rules Out disease".

Page 14: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

SpPinConversely, a very specific test, when positive, rules in disease. "SpPIn"! 

The sensitivity of gallop for CHF is only 24% (10/41), but the specificity is 99% (215/218).  Thus, if a patient has a gallop murmur, they probably have CHF (10 out of 13).

Page 15: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Iron deficiency anemiaTotals

Present Absent

Diagnostic

test result (Serum ferritin)

(+)<65

mmol/L

731a

270b

1001 a+b

(-)>65

mmol/L

78c

1500d

1578 c+d

Totals809 a+c

1770 b+d

2579

a+b+c+d

Sensitivity=a/a+c=90%Specificity =d/b+d=85%

Pos predictive value=a/a+b=73%Neg predictive value=d/c+d=95%

LR + = sn/(1-sp)=90/15=6

Prevalence=(a+c)/(a+b+c+d)= 32%

Posttest odd =Pretest odd xLikelihood Ratio

Predictor

Outcome

Page 16: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Odds = ratio of two probabilities Odds = p/1-p Probability = odds/1+odds

Likelihood ratio (+):Prob (+) result in people with the

diseaseProb (+) result in people w/out the

disease

Pretest Odds X LR = Posttest Odds

Page 17: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Key Concept

Likelihood Ratio– Relative likelihood that a given test

would be expected in a patient with (as opposed to one without) a disorder of interest.

probability of the test result in pts without disease

LR=probability of a test result in pts with disease

Page 18: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Likelihood ratios (LR) General Rules of Thumb

LR > 10 or < 0.1 produce large changes in pre-test probabilityLR of 5 to 10 or 0.1 to 0.2 produce moderate changesLR of 1 to 2 or 0.5 to 1 produce small changes in pre-test probability

Page 19: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

TestTest

CA B

pretest probability

0 .10 .20 .30 .40 .50 .60 .70 .80 .90 1

do not test

do nottreat

do not test

do nottreat

do not test

get on with treatment

do not test

get on with treatment

Likelihood ratio

posttest probability

TestTest

+ = Sn/(1-Sp)(1-Sn)/Sp= -

PreTest odds x LR

pretest probability

Page 20: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Serum ferritin (mmol/L)

Iron def positiveIron def negative Likelih

ood ratio

Diagnostic impact

No % No %

Very positive <15 47459

(474/809)20

1.1(20/1770)

52Rule inSpPin

Moderately positive

15-34 17522

(175/809)79

4.5(79/1770)

4.8IntermedHigh

Neutral 35-64 8210

(82/809)171

10(171/1770)

1Indeter mine

Moderately negative

65-94 303.7

(30/809)168

9.5(168/1770)

0.39Intermed low

Extremely negative

>95 485.9

(48/809)1332

75(1332/1770)

0.08Rule outSnNout

809100

(809/809)1770

100(1770/1770)

The usefulness of five levels of a diagnostic test result

Page 21: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Pretest probability

Likelihood ratio

Posttest probability

Page 22: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

T4 value Hypothyroid

Euthyroid

5 or less 18 1

5.1 – 7.0 7 17

7.1 – 9.0 4 36

9 or more 3 39

Totals 32 93

T4 value Hypothyroid

Euthyroid

≤ 5 18 1

> 5 14 92

Totals 32 93

T4 value Hypothyroid

Euthyroid

≤ 7 25 18

> 7 7 75

Totals 32 93

T4 value Hypothyroid

Euthyroid

≤ 9 29 54

> 9 3 39

Totals 32 93

Cutoff point

Sens Spec

5 0.56 0.99

7 0.78 0.81

9 0.91 0.42

T4 level in suspected hypo-thyroidism in children

For tests / predictors with continuous values result , cutoff points should be determine to choose the best value to use in distinguishing those with and without the target disorder

Page 23: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Cutoff point

Sens Spec

5 0.56 0.99

7 0.78 0.81

9 0.91 0.42

Cutoff point

SensTP

1-SpecFP

5 0.56 0.01

7 0.78 0.19

9 0.91 0.58

Page 24: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Accuracy of the test

The accuracy of the test depends on how well the test separates the group being tested into those with and without the disease in questionAccuracy is measured by the area under the ROC curve. An area of 1 represents a perfect test; an area of 0.5 represents a worthless test (AUC)

• 0.90-1.00 = excellent (A)

• 0.80-0.90 = good (B)

• 0.70-0.80 = fair (C) • 0.60-0.70 = poor (D) • 0.50-0.60 = fail (F)

Page 25: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

An ROC curve demonstrates several things:

It shows the tradeoff between sensitivity and specificity

• any increase in sensitivity will be accompanied by a decrease in specificity

The closer the curve follows the left-hand border and then the top border of the ROC space, the more accurate the test. The closer the curve comes to the 45-degree diagonal of the ROC space, the less accurate the test. The slope of the tangent line at a cutoff point gives the likelihood ratio (LR) for that value of the test.

Page 26: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Appraising DxTest

Is the evidence valid? (V)• Was there an independent, blinded comparison with a gold standard?

• Was the test evaluated in an appropriate spectrum of patients?

• Was the reference standard applied regardless of the test result?

• Was the test validated in a second, independent group of patients?

Page 27: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Can I trust the accuracy data?

RAMMbo

Recruitment: Was an appropriate spectrum of patients included? (Spectrum Bias)Maintainence: All patients subjected to a Gold Standard? (Verification Bias)Measurements: Was there an independent, blind or objective comparison with a Gold standard? (Observer Bias; Differential

Reference Bias)

Guyatt. JAMA, 1993

Page 28: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Critical AppraisalIs this valid test important? (I)

• Distinguish between patients with and those without the disease

• Two by two tables• Sensitivity and Specificity

–SnNOut–SpPIn

• ROC curves• Likelihood Ratios

Page 29: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.
Page 30: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Critical Appraisal

Can I apply this test to my patient (A)

• Similarity to our patient• Is it available• Is it affordable• Is it accurate• Is it precise

Page 31: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Critical Appraisal

Can I apply this test to my patient?

• Can I generate a sensible pre-test probability–Personal experience–Practice database–Assume prevalence in the study

Page 32: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Critical Appraisal

Diagnosis– Can I apply this test to a specific patient

• Will the post-test probability affect management

– Movement above treatment threshold– Patient willing to undergo testing

Page 33: Appraising A Diagnostic Test Clinical Epidemiology and Evidence-based Medicine Unit FKUI-RSCM.

Thank You