Uncertainty in measurement for 43 biochemistry ...static.probioqual.com/pdf/Doc/article LTCV...

download Uncertainty in measurement for 43 biochemistry ...static.probioqual.com/pdf/Doc/article LTCV PBQ.pdf · Matar et al.: Uncertainty in measurement estimates from external quality assessment

If you can't read please download the document

Transcript of Uncertainty in measurement for 43 biochemistry ...static.probioqual.com/pdf/Doc/article LTCV...

  • Clin Chem Lab Med 2015; aop

    Gladys Matar * , Bernard Poggi , Roland Meley , Chantal Bon , Laurence Chardon , Karim Chikh , Anne-Claude Renard , Catherine Sotta , Jean-Christophe Eynard , Regine Cartier and Richard Cohen

    Uncertainty in measurement for 43 biochemistry, immunoassay, and hemostasis routine analytes evaluated by a method using only external quality assessment data

    DOI 10.1515/cclm-2014-0942 Received September 24 , 2014 ; accepted February 19 , 2015

    Abstract

    Background: International organizations require from medical laboratories a quantitative statement of the uncertainty in measurement (UM) to help interpret patient results. The French accreditation body (COFRAC) recom-mends an approach (SH GTA 14 IQC/EQA method) using both internal quality control (IQC) and external quality assessment (EQA) data. The aim of this work was to vali-date an alternative way to quantify UM using only EQA results without any need for IQC data. This simple and practical method, which has already been described as the long-term evaluation of the UM (LTUM), is based on

    linear regression between data obtained by participants in EQA schemes and target values. We used it for 43 routine analytes covering biochemistry, immunoassay, and hemo-stasis fields. Methods: Data from 50 laboratories participating in Pro-BioQual (PBQ) EQA schemes over 25months were used to obtain estimates of the median and 90th percentile LTUM and to compare them to the usual analytical goals. Then, the two UM estimation methods were compared using data from 20 laboratories participating in both IQC and EQA schemes. Results: Median LTUMs ranged from 2.9% (sodium) to 16.3% (bicarbonates) for biochemistry analytes, from 12.6% (prothrombin time) to 18.4% (factor V) for hemosta-sis analytes when using the mean of all participants, and were around 10% for immunoassays when using the peer-group mean. Median LTUMs were, in most cases, slightly lower than those obtained with the SH GTA 14 method, whatever the concentration level. Conclusions : LTUM is a simple and convenient method that gives UM estimates that are reliable and comparable to those of recommended methods. Therefore, proficiency testing (PT) organizers are allowed to provide participants with an additional UM estimate using only EQA data and which could be updated at the end of each survey.

    Keywords: external quality assessment; internal quality control; long-term analytical coefficient of variation; pro-ficiency testing; uncertainty in measurement.

    List of abbreviations: AB, accuracy bias; AFP, -fetoprotein; AL, acceptable limit; ALAT, alanine ami-notransferase; ALP, alkaline phosphatase; ANOVA, analy-sis of variance; AP, all participant results; aPTT, activated partial thromboplastin time; ASAT, aspartate aminotrans-ferase; CB, constant bias; CEA, carcinoembryonic antigen; CK, creatine kinase; CV WL , within-laboratory coefficient

    *Corresponding author: Gladys Matar, ProBioQual, 9 rue Professeur Florence 69003, Lyon, France, E-mail: [email protected] Bernard Poggi and Chantal Bon: Service de Biochimie et Biologie mol culaire, H pital de la Croix-Rousse, Hospices Civils de Lyon, Lyon, France ; and ProBioQual, Lyon, France Roland Meley: Service d H matologie, St Etienne, France ; and ProBioQual, Lyon, France Laurence Chardon: Service de Biochimie et Biologie mol culaire, H pital Edouard Herriot, Hospices Civils de Lyon, Lyon, France ; and ProBioQual, Lyon, France Karim Chikh: Service de Biochimie et Biologie mol culaire, Centre Hospitalier Lyon Sud, Hospices Civils de Lyon, Pierre-B nite, France ; and ProBioQual, Lyon, France Anne-Claude Renard, Catherine Sotta and Jean-Christophe Eynard: ProBioQual, Lyon, France Regine Cartier: Service de Biochimie et Biologie mol culaire, Groupement Hospitalier Est, Hospices Civils de Lyon, Bron, France ; and ProBioQual, Lyon, France Richard Cohen: Service de Biochimie et Biologie mol culaire, H pital Edouard Herriot, Hospices Civils de Lyon, Lyon, France ; Universit Claude Bernard Lyon-1, ISPB Facult de pharmacie, MATEIS UMR CNRS 5510, France ; and ProBioQual, Lyon, France

    Authenticated | [email protected] author's copyDownload Date | 3/27/15 10:16 AM

  • 2Matar etal.: Uncertainty in measurement estimates from external quality assessment

    of variation; COFRAC, French accreditation body; DBMS, database management system; EQA, external quality assessment; EUROLAB, European Federation of National Associations of Measurement, Testing, and Analyti-cal Laboratories; FSH, follicle stimulating hormone; FT3, free triiodothyronine; FT4, free thyroxine; GGT, -glutamyltransferase; GUM, guide to the expression of uncertainty in measurement; IFCC, International Federa-tion of Clinical Chemistry and Laboratory Medicine; ILAC, International Laboratory Accreditation Cooperation; IOLM, International Organization of Legal Metrology; IQC, internal quality control; ISO, International Organization for Standardization; hCG, human chorionic gonadotropin; LCV, long-term analytical coefficient of variation; LDH, lactate dehydrogenase; LH, luteinizing hormone; LTB, long-term bias; LTUM, long-term uncertainty in measure-ment; PB, proportional bias; PBQ, ProBioQual; PG, peer group; ProT, prothrombin time; PSA, prostate-specific antigen; PT, proficiency testing; RE, random error; SE, sys-tematic error; SH GTA 14, COFRAC Accreditation Technical Guide for UM; s WL , within-laboratory standard deviation; TE, total error; TSH, thyroid stimulating hormone; um, standard uncertainty in measurement; UM, uncertainty in measurement (or expanded uncertainty in measurement for a 95% level of confidence).

    Introduction The National Institute of Standards and Technology speci-fies that: a measurement result is complete only when accompanied by a quantitative statement of its uncertainty. The uncertainty is required in order to decide if the result is adequate for its intended purpose and to ascertain if it is consistent with other similar results [1] . In clinical chemis-try, this is particularly important to ensure that test results are fit for their clinical purpose and do not compromise patient care. The uncertainty in measurement (UM) could, thereby, be used to assess the quantitative performances of analytical procedures [2] . Thus, medical laboratories are expected to estimate the uncertainty for test measure-ments, especially since many international organizations, such as International Federation of Clinical Chemistry and Laboratory Medicine (IFCC) [3] , International Labora-tory Accreditation Cooperation (ILAC) [4] , International Organization of Legal Metrology (IOLM) [5] and Interna-tional Organization for Standardization (ISO) [6, 7] [ISO 17025 (2005) and ISO 15189 (2012)], required it. According to the ISO 15189 (2012) 5.5.1.4 standard: The laboratory shall determine the uncertainty of measurement for each

    procedure in the analytical phase used to record the meas-ured values of patient samples .

    In the aim to provide terminology and methodology for expressing UM, a joint working group consisting of experts from these and other organizations published a Guide to the expression of uncertainty in measurement (GUM) [8] , which establishes the general rules for evaluating and expressing UM. The GUM presents different approaches to estimate UM using a mathematical theory and experimental observation, so-called modeling approaches . These methods were compared to alternative empirical approaches in the tech-nical report published by European Federation of National Associations of Measurement, Testing, and Analytical Lab-oratories (EUROLAB) [9] , which classifies these different approaches in two categories: the intra- and inter-laboratory approach. In the first category, estimation of the UM is based on the GUM modeling approaches or on individual data combinations available from reference methods and inter-nal quality control (IQC), providing an estimation of repeat-ability, within-laboratory reproducibility, and bias. In the second category, the UM estimation is based on validation method or proficiency testing (PT) data from the labora-tory s results obtained during participation in an external quality assessment (EQA).

    According to the empirical approaches , the UM is estimated from the imprecision and the bias. The impre-cision, in general, is quantified by the within-laboratory reproducibility standard deviation (s WL ), or coefficient of variation (CV WL ) obtained under within-laboratory repro-ducibility conditions (often called intermediate condi-tions ): different operators, reagent batches, calibrators, or long-time repetition. The bias is quantified by the devi-ations of measurement results from corresponding refer-ence values. Thus, a combination of the intralaboratory and the interlaboratory approaches is possible using IQC data to estimate the imprecision (s WL ), and the EQA labora-tory s results to estimate the bias. This combined method was suggested in the EUROLAB Technical report, and recommended in the SH GTA 14 [10] , the technical guide for the UM estimation, published by COFRAC (the French accreditation body) in 2011.

    As the UM is used to assess the performances of quan-titative analytical procedures, Meijer etal. [11] proposed to express the UM as the long-term analytical performance (LTUM) of a laboratory using only its EQA results over a long period of time. The imprecision is then expressed by the long-term analytical CV (LCV) and the deviation of the laboratory s result from the measurand assigned value is determined by the long-term bias (LTB).

    In order to provide participants with an estimate of the UM by a different method and to allow the PT organizer to

    Authenticated | [email protected] author's copyDownload Date | 3/27/15 10:16 AM

  • Matar etal.: Uncertainty in measurement estimates from external quality assessment3

    include UM estimation in the reports established at the end of each survey, we quantified the LTUM of a large panel of routine analytes assayed in a medical laboratory. We used data of 50 laboratories participating in ProBioQual (PBQ) EQA programs in biochemistry, immunoassay, and hemo-stasis over a long period of time (25 months). Moreover, data from 20 laboratories participating in both IQC and EQA schemes were used to compare the LTUM and the SH GTA 14 (IQC/EQA) UM estimation methods and to point out the advantages and the drawbacks of each approach.

    Materials and methods ProBioQual

    PBQ (http://www.probioqual.com) is a French non-profi t, non-gov-ernmental association established in 1972 by biologists to promote training and quality control in medical laboratories. As a PT provider, PBQ proposes six IQC and 26 EQAs in biochemistry, immunology, hematology, serology, and drug tests, including over 200 diff erent analytes. Thus, almost 1700 international medical laboratories par-ticipate in at least one of these programs, allowing an assessment of their analytical procedures.

    Survey specimens and laboratories

    Laboratories received, per year, human sera lyophilized specimens (24 for biochemistry, or 12 for immunoassay), or 16 human citrated plasmas lyophilized specimens for hemostasis EQA schemes. Intra- and inter-vials homogeneity tests results showed that lyophilization procedure was successfully applied but the recovery of this procedure was not tested. All survey samples were shipped at room temperature and then stored in the laboratories at 2 C/8 C until reconstitution. Simulation tests on the survey specimens have proved their stability during all the shipment process. According to an established sched-ule, the participants processed each specimen as if it was a patient sample, using routine methods to determine the concentration of each measurand. The main commercial methods, reagents and devices used by the laboratories to assess the EQA schemes samples are listed in the Table 1 . Aft er results submission on the PBQ website, under anonymous laboratory code number, statistical evaluation was performed according to the ISO guideline 13528 (2005) [12] , as it is recommended in the PT-specifi c standard ISO 17043 (2010) [13] . The ISO 13528 (2005) robust algorithm A was applied to calculate the mean of submitted results, which was used as the consensus value. No further statistical analysis of outliers was performed. The iterative process was terminated when the mean and CV estimates were sta-ble and the percentage of values modifi ed then did not exceed 15%. Thereby, two consensus values were determined: the fi rst one was the mean of all participants results (AP) and the second was that of laboratories using the same assay type of method, the same reagents and the same device, forming a peer group (PG). A minimum of six participants was required to apply the robust algorithm. The labora-tories received a PT report including the results of all participants,

    with an individual evaluation of the laboratory performance. PBQ provided advice on the interpretation of the statistical analysis and comments on participants performance.

    Each laboratory participating to PBQ EQA Schemes had an anonymous code number. The laboratories participating to this study were selected randomly from the beginning, the middle, and the end of our anonymous code number list. The new ones had the latest numbers and the older had the fi rst numbers. Like that, the selected laboratories represented experienced and non-experienced labora-tory in our EQA schemes. Fift y laboratories participating in PBQ bio-chemistry (915 participants in 2013), immunoassay (892 participants in 2013) or hemostasis (848 participants in 2013) EQA schemes over 25months (January 2011 until January 2013) were selected to test the LTUM method estimation. This prolonged period of time was suffi -cient to take into account a number of PT surveys covering a wide concentration range of measurands, hence allowing a real long-term analytical performance evaluation. Since these programs did not have the same survey frequency (weekly, monthly, or 2-monthly), between eight and 45 results were included in the evaluation of the LTUM for each laboratory. For the comparison between LTUM and SH GTA 14 (IQC/EQA) methods, data from 20 laboratories participating in both IQC (153 and 170 participants in biochemistry and hemostasis 2013 IQC schemes, respectively) and EQA programs, were used over the same period of time.

    LTUM method

    The LTUM method used the linear regression model (y = bx + a), as described by Meijer etal. [11] , to express the laboratory value (y) as a function of the target value (x), where b is the slope and a is the intercept point ( Figure 1 ). The total error (TE) is presented in Figure 1 as the diff erence between each data point (the laboratory value) and the identity line y = x (the target value). Thus, it could be assimilated to the accuracy bias (AB) or to the UM. The TE is composed of a random error (RE) and a systematic error (SE), itself composed of a constant (CB) and a proportional biases (PB). The RE corresponds to the disper-sion of the data points around the regression line caused by routine operating conditions variability (as multiple calibrators and reagent batches, multiple operators, equipment maintenance, etc.) over the EQA period of time, and is quantifi ed by the LCV. SE is represented by the deviation of the regression line from the identity line and cor-responds to the LTB. The CB (vertical distance between parallel and identity lines on Figure 1) is equal to the intercept a of the regression line at every level within the concentration range. It could be due to the so-called matrix-eff ect . The PB is represented, at each concen-tration level, by the diff erence between the regression and parallel lines. Its intensity increases with concentration level and could be due to calibration errors. The LCV and the LTB represent the two compo-nents of the UM, which could be determined as follows [11] :

    2 2UM 1.96 um 1.96 LCV LTB= = + (1)

    y/xsLCVx

    =

    (2)

    2 2 2x

    n 1 (b 1) s ( y x )nLTB

    x

    + =

    (3)

    Authenticated | [email protected] author's copyDownload Date | 3/27/15 10:16 AM

  • 4Matar etal.: Uncertainty in measurement estimates from external quality assessment

    Tabl

    e 1 :

    Bio

    chem

    istry

    , hem

    osta

    sis a

    nd im

    mun

    oass

    ay m

    ain

    com

    mer

    cial

    met

    hods

    and

    dev

    ices

    .

    AN

    AL

    YT

    ES

    ME

    TH

    OD

    S

    Bic

    arb

    on

    ates

    SP

    EC

    TRA

    L R

    EF

    LEC

    TOM

    ETR

    YV

    itros

    Dev

    ice

    UV

    EN

    ZY

    MA

    TIC

    ME

    THO

    DS

    PH

    KIN

    ETI

    CS

    To

    tBili

    rub

    inE

    NZ

    YM

    ATI

    CS

    ME

    THO

    DS

    3 a

    lpha

    -HD

    S p

    hoto

    met

    ry

    Cal

    ciu

    m

    SP

    EC

    TRA

    L R

    EF

    LEC

    TOM

    ETR

    YV

    itros

    Dev

    ice

    SE

    LEC

    TIV

    E E

    LEC

    TRO

    DE

    S

    SP

    EC

    TRA

    L P

    HO

    TOM

    ETR

    Y N

    M-B

    AP

    TA

    SP

    EC

    TRA

    L P

    HO

    TOM

    ETR

    Y

    O.C

    RE

    SO

    L P

    HTA

    LEIN

    SP

    EC

    TRA

    L P

    HO

    TOM

    ETR

    Y A

    RS

    EN

    AZ

    O II

    I

    Ch

    ole

    ster

    ol

    SP

    EC

    TRA

    L R

    EF

    LEC

    TOM

    ETR

    YV

    itros

    Dev

    ice

    ES

    TER

    AS

    E O

    XY

    DA

    SE

    PE

    RO

    XY

    DA

    SE

    -

    chro

    mog

    en.

    phen

    olic

    (PA

    P)

    ES

    TER

    AS

    E O

    XY

    DA

    SE

    PE

    RO

    XY

    DA

    SE

    -

    chro

    mog

    en.

    non

    phen

    olic

    (PA

    P)

    Cre

    atin

    ine

    SP

    EC

    TRA

    L R

    EF

    LEC

    TOM

    ETR

    Y

    V

    itros

    Dev

    ice

    JAF

    FE

    -S

    tand

    ardi

    zed

    IDM

    S -

    Cor

    rect

    ed

    JAF

    FE

    -S

    tand

    ardi

    zed

    IDM

    S N

    on c

    orre

    cted

    JAF

    FE

    -N

    on S

    tand

    ardi

    zed

    IDM

    S -

    Non

    Cor

    rect

    ed

    EN

    ZY

    MA

    TIC

    S M

    ETH

    OD

    S

    Iro

    nS

    PE

    CTR

    AL

    RE

    FLE

    CTO

    ME

    TRY

    Vitr

    osD

    evic

    e

    FE

    RE

    NE

    -D

    IRE

    CT

    FE

    RR

    OZ

    INE

    Pho

    tom

    etry

    Glu

    cose

    HE

    XO

    KIN

    AS

    E

    SP

    EC

    TRA

    L R

    EF

    LEC

    TOM

    ETR

    YV

    itros

    Dev

    ice

    GO

    D -

    PO

    D "

    Trin

    der"

    GO

    D E

    LEC

    TRO

    DE

    Mag

    nes

    ium

    SP

    EC

    TRA

    L R

    EF

    LEC

    TOM

    ETR

    Y V

    itros

    Dev

    ice

    SP

    EC

    TRA

    L P

    HO

    TOM

    ETR

    Y -

    AR

    SE

    NA

    ZO

    SP

    EC

    TRA

    L P

    HO

    TOM

    ETR

    Y C

    ALM

    AG

    ITE

    SP

    EC

    TRA

    L P

    HO

    TOM

    ETR

    Y -

    VA

    RIE

    D C

    OLO

    R

    EN

    ZY

    MA

    TIC

    S M

    ETH

    OD

    S

    Ph

    osp

    hat

    esS

    PE

    CTR

    AL

    RE

    FLE

    CTO

    ME

    TRY

    Vitr

    osD

    evic

    e

    Fer

    rous

    sulfa

    te re

    duci

    ngag

    ent

    PH

    OS

    PH

    OM

    OLY

    BD

    ATE

    FR

    EE

    RE

    DU

    CTI

    ON

    (U.V

    .)

    Cl/K

    /Na

    DIR

    EC

    T P

    OTE

    NTI

    OM

    ETR

    Y

    IND

    IRE

    CT

    PO

    TEN

    TIO

    ME

    TRY

    Tri

    gly

    ceri

    de

    SP

    EC

    TRA

    L R

    EF

    LEC

    TOM

    ETR

    Y

    V

    itros

    Dev

    ice

    GP

    O P

    AP

    ME

    THO

    D -

    mea

    sure

    men

    t of

    tot

    al g

    lyce

    rol

    Ura

    te

    SP

    EC

    TRA

    L R

    EF

    LEC

    TOM

    ETR

    Y

    V

    itros

    Dev

    ice

    UR

    ICA

    SE

    PE

    RO

    XY

    DA

    SE

    CH

    RO

    MO

    GE

    NE

    DIR

    .

    UR

    ICA

    SE

    PE

    RO

    XY

    DA

    SE

    CH

    RO

    MO

    GE

    NE

    + A

    OD

    UR

    ICA

    SE

    U.V

    . -29

    3 nm

    DIR

    EC

    TE

    Ure

    aS

    PE

    CTR

    AL

    RE

    FLE

    CTO

    ME

    TRY

    Vitr

    osD

    evic

    e

    UR

    EA

    SE

    U.V

    . -ki

    netic

    s

    UR

    EA

    SE

    with

    ELE

    CTR

    OD

    E -

    Con

    duct

    omet

    ry

    BIO

    CH

    EM

    IST

    RY

    AN

    AL

    YT

    ES

    AS

    SE

    SS

    ME

    NT

    SU

    PP

    LIE

    RD

    EV

    ICE

    SA

    BB

    OT

    T

    Dia

    gn

    ost

    icA

    rchi

    tect

    c80

    00

    Arc

    hite

    ct c

    4000

    BE

    CK

    MA

    NU

    niC

    elD

    xC60

    0 / 6

    00i

    Uni

    Cel

    DxC

    800

    AU

    400

    / AU

    480

    / A

    U 6

    40/

    AU

    680

    RO

    CH

    E

    Dia

    gn

    ost

    ics

    Mod

    ular

    P -

    PP

    -D

    P

    Cob

    as(c

    501-

    c502

    )C

    obas

    Inte

    gra

    400

    Cob

    asIn

    tegr

    a80

    0H

    itach

    i 717

    SIE

    ME

    NS

    Adv

    ia16

    50/1

    800

    Dim

    ensi

    on E

    xLD

    imen

    sion

    RxL

    HM

    D

    imen

    sion

    Vis

    ta

    Dim

    ensi

    on X

    pand

    TH

    ER

    MO

    S

    CIE

    NT

    IFIC

    Kon

    elab

    60/i

    and

    Kon

    elab

    Prim

    e 60

    /i

    Kon

    elab

    30/i

    and

    Kon

    elab

    Prim

    e 30

    /i

    OR

    TH

    O

    Clin

    ical

    Dia

    gn

    ost

    ic

    Vitr

    os5.

    1 F

    S

    Vitr

    os35

    0

    IMM

    UN

    OA

    SS

    AY

    AN

    AL

    YT

    ES

    AS

    SE

    SS

    ME

    NT

    SU

    PP

    LIE

    RD

    EV

    ICE

    S

    AB

    BO

    TT

    Dia

    gn

    osti

    cA

    rchi

    tect

    i100

    0/20

    00

    BE

    CK

    MA

    NU

    nice

    lDX

    i800

    -60

    0i

    Acc

    ess/

    DxI

    /DxC

    /Lxi

    BIO

    ME

    RIE

    UX

    Vid

    as

    RO

    CH

    E D

    iag

    no

    stic

    s

    Cob

    as e

    601

    (cob

    as 6

    000

    serie

    s)

    Mod

    ular

    E (

    170)

    -E

    E

    Cob

    as e

    602

    (cob

    as 8

    000

    serie

    s)

    Ele

    csys

    2010

    Cob

    as e

    411

    (cob

    as 4

    000

    serie

    s)

    SIE

    ME

    NS

    AD

    VIA

    Cen

    taur

    /Cen

    taur

    XP

    Dim

    ensi

    on V

    ista

    Imm

    ulite

    2000

    / 200

    0 X

    Pi

    TH

    ER

    MO

    SC

    IEN

    TIF

    ICK

    rypt

    or/K

    rypt

    orco

    mpa

    ct

    TO

    SO

    H b

    iosc

    ien

    ces

    AIA

    200

    0

    OR

    TH

    O C

    linic

    alD

    iag

    no

    stic

    sV

    itros

    5600

    HE

    MO

    ST

    AS

    IS A

    NA

    LY

    TE

    S A

    SS

    ES

    SM

    EN

    T

    SU

    PP

    LIE

    RD

    EV

    ICE

    SM

    ET

    HO

    DS

    RE

    AG

    EN

    TS

    Inst

    rum

    enta

    tion

    L

    abo

    rato

    ryF

    RA

    NC

    E

    AC

    L E

    LITE

    PR

    O

    (700

    0, 8

    000,

    900

    0)O

    PTI

    CA

    LD

    ETE

    CTI

    ON

    Hem

    osil

    Ant

    ithro

    mbi

    nH

    emos

    ilA

    ntith

    rom

    bin

    liqui

    deIL

    Hem

    osil

    Fib

    rinog

    enC

    laus

    sIL

    Hem

    ostil

    TP F

    ibH

    S P

    lus

    IL H

    emos

    ilR

    ecom

    bipl

    astin

    eIL

    Hem

    osil

    AP

    TT S

    PIL

    Hem

    osil

    Syn

    thas

    ilP

    lasm

    a de

    ficie

    nten

    fac

    teur

    V

    AC

    L TO

    P

    RO

    CH

    E

    Dia

    gn

    ost

    ics

    HIT

    AC

    HI,

    MO

    DU

    LAR

    OP

    TIC

    AL

    DE

    TEC

    TIO

    NA

    ntith

    rom

    bine

    C

    OB

    AS

    , IN

    TEG

    RA

    SIE

    ME

    NS

    Sys

    mex

    CA

    (CA

    -500

    , C

    A-1

    500,

    CA

    -700

    0)O

    PTI

    CA

    LD

    ETE

    CTI

    ON

    Ber

    ichr

    omA

    T II

    IIn

    nova

    nce

    Ant

    ithro

    mbi

    neP

    lasm

    a ex

    empt

    de

    fact

    eur V

    Mul

    tifib

    ren

    UD

    ade

    Thro

    mbi

    ne

    Thro

    mbo

    relS

    Inno

    vin

    (Thr

    omb.

    R

    ecom

    bina

    nte)

    Pat

    hrom

    tinS

    LA

    ctin

    FS

    SY

    SM

    EX

    CS

    (2

    000/

    2000

    i, 21

    00/2

    100i

    , 510

    0)B

    CT,

    BC

    S, B

    CS

    (X

    P)

    ST

    AG

    OS

    TA, S

    TA c

    ompa

    ct,

    STA

    -R, S

    TA-R

    E

    volu

    tion

    VIS

    CO

    SIM

    ETR

    ICA

    L D

    ETE

    CTI

    ON

    Sta

    chro

    mA

    T II

    IS

    TA d

    efic

    ient

    en f

    acte

    ur V

    Fib

    ripre

    stA

    utom

    ate

    STA

    Fib

    rinog

    en,

    No

    plas

    tine

    CI

    No

    plas

    tine

    CI P

    LUS

    CK

    Pre

    stP

    TT A

    utom

    ate

    Cep

    hasc

    reen

    Tri

    nit

    y B

    iote

    ch

    KC

    4, K

    C10

    , KC

    40M

    EC

    AN

    ICA

    L D

    ETE

    CTI

    ON

    Trin

    iclo

    tFib

    rinog

    ne

    Kit

    Trin

    iclo

    tAut

    omat

    ed

    AP

    TTTr

    inic

    lotA

    PTT

    HS

    Opt

    ion

    2, 4

    , 8D

    estin

    yM

    ax, D

    estin

    yP

    lus

    OP

    TIC

    AL

    DE

    TEC

    TIO

    NC

    oag-

    A-M

    ate

    XC

    AA

    NA

    LY

    TE

    SM

    ET

    HO

    DS

    AL

    AT

    / AS

    AT

    SP

    EC

    TRA

    L R

    EF

    LEC

    TOM

    ETR

    Y V

    itros

    Dev

    ice

    IFC

    C M

    ETH

    OD

    S w

    ithP

    LPIF

    CC

    ME

    THO

    DS

    with

    outP

    LP

    Am

    ylas

    eS

    PE

    CTR

    AL

    RE

    FLE

    CTO

    ME

    TRY

    Vitr

    osD

    evic

    eN

    ITR

    OP

    HE

    NY

    L P

    OLY

    SA

    CC

    HA

    RID

    E N

    P-G

    3N

    ITR

    OP

    HE

    NY

    L P

    OLY

    SA

    CC

    HA

    RID

    E N

    P-G

    5/G

    6/G

    7

    CK

    SP

    EC

    TRA

    L R

    EF

    LEC

    TOM

    ETR

    Y

    V

    itros

    Dev

    ice

    N-A

    CE

    TYL-

    CY

    STE

    INE

    IF

    CC

    Rec

    omm

    anda

    tion

    N-A

    CE

    TYL-

    CY

    STE

    INE

    oth

    ers

    GG

    T

    SP

    EC

    TRA

    L R

    EF

    LEC

    TOM

    ETR

    Y

    V

    itros

    Dev

    ice

    NO

    N-C

    AR

    BO

    XY

    L S

    UB

    STR

    ATE

    C

    AR

    BO

    XY

    L S

    UB

    STR

    ATE

    C

    AR

    BO

    XY

    L S

    UB

    STR

    ATE

    IF

    CC

    Rec

    omm

    anda

    tion

    LD

    H

    SP

    EC

    TRA

    L R

    EF

    LEC

    TOM

    ETR

    Y

    V

    itros

    Dev

    ice

    PY

    RU

    VA

    TE S

    UB

    STR

    ATE

    -D

    GK

    CLA

    CTA

    TE S

    UB

    STR

    ATE

    -IF

    CC

    (L--

    >P)

    UV

    ME

    THO

    DS

    -IF

    CC

    (L-

    ->P

    )P

    YR

    UV

    ATE

    SU

    BS

    TRA

    TE -

    OTH

    ER

    S

    LIP

    AS

    ES

    PE

    CTR

    AL

    RE

    FLE

    CTO

    ME

    TRY

    Vitr

    osD

    evic

    eC

    OLO

    RIM

    ETR

    Y

    AL

    PS

    PE

    CTR

    AL

    RE

    FLE

    CTO

    ME

    TRY

    Vitr

    osD

    evic

    eA

    .M.P

    . TA

    MP

    ON

    (NO

    N I

    FC

    C/S

    FB

    C)

    A.M

    .P. T

    AM

    PO

    N (I

    FC

    C/S

    FB

    C)

    A A

    B C

    List

    of b

    ioch

    emis

    try m

    ain

    com

    mer

    cial

    met

    hods

    (A) a

    nd o

    f the

    mai

    n de

    vice

    s use

    d by

    the

    parti

    cipa

    nts t

    o as

    sess

    bio

    chem

    istry

    (A ),

    hem

    osta

    sis (

    B) a

    nd im

    mun

    oass

    ay (C

    ) ana

    lyte

    s. B

    esid

    es th

    e m

    ain

    devi

    ces a

    nd d

    etec

    tion

    met

    hods

    , rea

    gent

    s are

    list

    ed fo

    r hem

    osta

    sis a

    naly

    tes a

    sses

    smen

    t (B)

    .

    Authenticated | [email protected] author's copyDownload Date | 3/27/15 10:16 AM

  • Matar etal.: Uncertainty in measurement estimates from external quality assessment5

    combined data from IQC and calibration uncertainty. However, for convenient use in medical laboratories, the COFRAC recommends the third one which was well described by Fisicaro etal. [14, 15] . Briefl y, the in-house validation method or IQC data were used to determine imprecision, and the EQA laboratory data were used to calculate bias. Calculation could be made either using concentration values (s WL and bias expressed as concentration units) or using relative values (CV WL and bias expressed as a percentage of the assigned value) for the ran-dom and the systematic component of error, respectively. Here, both UM terms were expressed as percentages. Applying the uncertainty propagation law, an estimation of UM was given by the following equations:

    2 2UM 1.96 u 1.96 u(IQC) u(EQA)= = + (5)

    WLsu(IQC) CV 100m

    = =

    (6)

    22E

    | E |u(EQA) s3

    = +

    (7)

    where CV = coeffi cient of variation, s WL = within- laboratory standard deviation, m = IQC laboratory mean,

    = = = 2i ii i

    E(%) ii

    E (%) (E (%) E(%)) y xE(%) , S , E (%) 100n n 1 x

    and n = number of laboratory EQA results. As required, UM should be determined at two concentration

    levels at least corresponding to the critical clinical limits [16] . These limits were chosen as specifi ed by Vassault etal. [17] for biochemistry analytes and normal and pathologic levels were determined by PBQ for hemostasis analytes. Only the EQA control samples correspond-ing to the same concentration range as IQC samples were selected among those used during the 25-month surveys.

    Finally, the median UMs, calculated from 20 laboratories either by the SH GTA 14 method or by the LTUM approach, were compared at each of the two concentration levels.

    Results

    Biochemistry EQA schemes

    The results of 50 laboratories participating in the PBQ biochemistry EQA program were used to quantify LTUM for 23 routine analytes including ions, substrates, and enzymatic parameters, according to Equations 1 4. Two different target values were considered for either that of AP ( Figure2 A), or that of the PG (Figure 2B), except for the enzymatic analytes, for which only the PG target values were taken into account because of the high dispersion between AP results. To evaluate the performance of 50% and 90% of the laboratories, median and 90th percentile LTUM were calculated and compared (Figure 2) to two analytical goals commonly used by laboratories for the

    2

    y/x 21s (y y)

    (n 2 )(x x)(y y)

    (x x)

    =

    (4)

    where um = standard uncertainty in measurement, UM = expanded uncertainty in measurement for a 95% level of confi dence (k = 1.96 is the coverage factor), x = consensus value; x = mean value for x; y = laboratory value; y = mean value for y; s x = standard error of x; b = slope; s y/x = variability of the regression line, based on the least-squares method and n = number of laboratory EQA results.

    For each of the 50 laboratories chosen as described above, LTUM was evaluated using diff erent consensus values obtained from the AP or PG groups. Calculation of specimen number n, LCV, LTB, um, and expanded UM were realized by the database management sys-tem (DBMS) generated by the 4D soft ware (V. 2012). For each survey, any result deviating from the assigned value by > 3 standard devia-tions of the comparison group ( | z | > 3) was identifi ed as an outlier and discarded. This method is only valid for a normal distribution and the number of results removed was < 3. As expected, the hypoth-esis that data came from a normal distribution, appreciated by the 2 goodness of fi t and Kolmogorov-Smirnov tests, was not rejected when considering PG data and rejected when considering AP data for most analytes. The LTUM evaluation was performed only when n 8 results were available from a laboratory, except when EQA sam-ples had to be split in two concentration levels for the comparison study between the LTUM and SH GTA 14 methods (n 6). The reduc-tion in the minimum data number for linear regression analysis did not aff ect signifi cantly either the LTUM estimates or the conclusion of this part of our study.

    SH GTA 14 (IQC/EQA) method

    The SH GTA 14 suggests four diff erent methods for the UM assess-ment based on the GUM Modeling approach [8] , on data from method validation, on combined data from IQC and EQA, and on

    Regression line y = bx+ a

    Identity line y=x

    Consensus value x

    a

    EQA laboratory results y

    Parallel line

    RE

    TESE

    PB

    CB

    Figure 1: Linear regression model for the assessment of UM: y = bx + a. CB, constant bias; PB, proportional bias; RE, random error; SE, systematic error; TE, total error.

    Authenticated | [email protected] author's copyDownload Date | 3/27/15 10:16 AM

  • 6Matar etal.: Uncertainty in measurement estimates from external quality assessment

    validation method purpose. One was the expanded AB defined for the medium concentration level by Vassault etal. [17] from the state-of-the-art deduced from the expe-rience of several French EQA organizers in 1999 and the second was the desirable TE derived by Ricos etal. (http://www.westgard.com/biodatabase1.htm) from biological variations database. Analytical goals derived from the state-of-the-art are less strict than those defined by biolog-ical variations except for analytes whose biological vari-ations are very high [bilirubin, iron, triglyceride, alanine

    aminotransferase (ALAT), creatine kinase (CK), and lipase]. Median AP LTUM was comprised between 2.9% (sodium) and 16.3% (bicarbonates), while the median PG LTUM range was 2.4% (sodium) to 13.7% (bicarbonates). As expected, AP LTUM was higher than PG LTUM for the majority of analytes. For some analytes (cholesterol, iron, glucose, phosphate, potassium, sodium, and urea), close estimates of the median LTUM were obtained whatever the target value used. When the AP consensus mean was used (Figure 2A), 90% or more laboratories fulfilled both LTUM

    Bic

    arbo

    nate

    s

    Tot B

    iliru

    bin

    Cal

    cium

    Chl

    orid

    e

    Cho

    lest

    erol

    Cre

    atin

    ine

    Iron

    Glu

    cose

    Mag

    nesi

    um

    Pho

    spha

    te

    Pot

    assi

    um

    Sod

    ium

    Trig

    lyce

    ride

    Ura

    te

    Ure

    a

    19.6 23.5 4.5 4.9 13.7 17.6 19.6 9.8 11.8 11.0 6.9 3.5 15.7 13.7 15.7 16.3 13.8 6.2 6.2 7.1 12.8 6.0 6.2 10.0 5.4 3.5 2.9 10.1 6.4 8.9 23.3 24.5 9.7 9.0 10.4 18.7 12.0 10.1 13.4 8.7 5.3 4.6 15.4 13.4 15.1 4.9 26.9 2.6 1.5 9.0 8.9 30.7 7.0 4.8 10.1 5.6 0.7 26.0 12.0 15.6

    Bic

    arbo

    nate

    s

    Tot B

    iliru

    bin

    Cal

    cium

    Chl

    orid

    e

    Cho

    lest

    erol

    Cre

    atin

    ine

    Iron

    Glu

    cose

    Mag

    nesi

    um

    Pho

    spha

    te

    Pot

    assi

    um

    Sod

    ium

    Trig

    lyce

    ride

    Ura

    te

    Ure

    a

    ALA

    T

    Am

    ylas

    e

    AS

    AT

    CK

    GG

    T

    LDH

    Lipa

    se

    ALP

    19.6 23.5 4.5 4.9 13.7 17.6 19.6 9.8 11.8 11.0 6.9 3.5 15.7 13.7 15.7 17.6 17.6 17.6 17.6 17.6 17.6 17.6 17.6

    13.7 7.0 4.1 3.1 5.0 7.1 4.9 5.3 6.6 5.1 3.0 2.4 5.8 4.7 7.2 6.4 5.5 6.9 8.0 8.3 8.4 8.4 7.8

    20.8 12.0 5.7 5.6 6.6 11.1 10.3 7.3 10.1 8.2 4.6 3.6 8.6 5.9 12.5 11.7 9.1 10.8 11.8 14.1 15.2 13.3 15.1 4.9 26.9 2.6 1.5 9.0 8.9 30.7 7.0 4.8 10.1 5.6 0.7 26.0 12,0 15,6 27.5 14.6 16.7 30.3 22.1 11.4 37.9 12.0

    0%

    5%

    10%

    15%

    20%

    25%

    30%

    35%

    40% B

    0%

    5%

    10%

    15%

    20%

    25%

    30%

    35%

    40% VassaultVV AB Median LTUM LL X 90th percentile LTUM Ricos TE A

    Figure 2: Biochemistry analytes LTUM evaluation. Median (50th percentile) and 90th percentile (cross point) LTUM of 50 laboratories evaluated from AP (A), or from PG (B) target values, are compared to the Vassault etal. [17] intermediate concentration level AB, and to Ricos desirable TE (January 2014) for 23 routine analytes.

    Authenticated | [email protected] author's copyDownload Date | 3/27/15 10:16 AM

  • Matar etal.: Uncertainty in measurement estimates from external quality assessment7

    were not calculated by ANOVA, the difference between the AP-CV and the median PG-CV reflected the extent of between PG-biases and provided a good idea of analyte standardization. Table 2 shows that the lowest differences were obtained for -fetoprotein (AFP), prostate-specific antigen (PSA), human chorionic gonadotropin (hCG), thyroid stimulating hormone (TSH) and free triiodothy-ronine (FT3) with the more satisfactory standardiza-tion status at this concentration level. The UM for these 15 routine immunoassay analytes was estimated by the LTUM method described above ( Figure 3 ). Only the PG target value was considered here, since standardization of immunoassays is not presently sufficiently advanced to allow the use of the AP consensus mean without greatly overestimating LTB and, hence, AP LTUM. Except for FT3 and free thyroxine (FT4), intra- and inter-individual varia-tions are high, leading to a desirable TE (Figure 3) between 21% [CA 15-3, follicle stimulating hormone (FSH)] and 46% (CA 19-9). The AB established from the state-of-the-art for these analytes was also very high, reaching values between 24% and 29%. The median PG LTUMs were around 10% and did not exceed 12.5%, widely below both state-of-the-art and biological variation analytical goals. This is confirmed by the LTUM 90th percentile showing that over 90% laboratories fulfilled analytical goals for the majority of analytes, except for FT3 and FT4, which are very finely regulated in vivo and for TSH.

    Hemostasis EQA schemes

    Five hemostasis analytes [antithrombin, factor V, fibrin-ogen, prothrombin time (ProT), and activated partial thromboplastin time (aPTT)] were selected to determine LTUM by using 50 laboratory results oncoming from their participation in the PBQ hemostasis EQA scheme. As pre-viously, biological variation analytical goals were found in the Ricos et al. database (http://www.westgard.com/biodatabase1.htm), except for factor V. Since the hemo-stasis analytical goals derived from the state-of-the-art are not presently defined, we decided to compare median LTUM estimates to the acceptable limits (AL) set at PBQ by expert biologists in charge of the hemostasis EQA scheme. Median LTUMs were between 12.6% (ProT) and 18.4% (factor V) when the AP consensus value was used ( Figure 4 A) and between 6.4% (aPTT) and 17.1% (factor V) when the PG consensus value was used (Figure 4B). Except for aPTT, the use of the PG consensus mean lowered slightly median LTUM. Apart from fibrinogen, both LTUM analytical goals were not met by the majority of the 50 laboratories when the AP consensus value was used.

    analytical goals for total bilirubin, iron, phosphate, potas-sium, triglyceride, urate, and urea; for calcium and chlo-ride, both analytical goals were satisfied only by < 50% laboratories; for cholesterol, creatinine, glucose, and magnesium, the state-of-the-art LTUM analytical goal was fulfilled by about 90% laboratories, but < 50% laborato-ries reached the biological variation analytical goal; lastly for sodium, according to its very low biological variation, the state-of-the-art analytical goal was met by 50% labo-ratories and the biological variation analytical goal by far < 50% laboratories. When using PG consensus value (Figure 2B), even more laboratories reached the two ana-lytical goals since LTUM was lower for most analytes; for enzymes, almost all laboratories fulfilled the state-of-the-art analytical goal while the biological variation goal, was reached by > 90% laboratories for ALAT, amylase, aspar-tate aminotransferase (ASAT), CK, -glutamyltransferase (GGT), lipase and by 70% laboratories for lactate dehydro-genase (LDH) and alkaline phosphatase (ALP).

    Immunoassay EQA schemes

    A representative example of results obtained during one immunoassay EQA 2013 survey for 15 routine analytes, including tumor markers and hormones, is given Table2 . Although the between- and within-method variability

    Table 2 : An example of results obtained during one immunoassay EQA survey (2013).

    Analyte n n PG AP consensus mean

    AP CV, %

    MedianPG CV, %

    AFP, kUI/L 640 13 70.5 9 5CEA, g/L 666 13 17.5 12 5CA 125, kU/L 607 13 44.2 12 5CA 15-3, kU/L 630 13 30.4 15 6CA 19-9, kU/L 634 13 34.3 21 7PSA, g/L 795 14 5.0 8 5Free PSA, g/L 616 15 0.54 17 6FSH, UI/L 667 12 26.7 12 4LH, UI/L 732 12 16.2 14 5hCG, UI/L 1081 17 31.1 9 6Prolactin, mUI/L 578 14 507 16 6TSH, mUI/L 918 14 4.56 8 4Free T3, pmol/L 734 12 5.19 11 7Free T4, pmol/L 828 12 17.0 11 6Estradiol, pmol/L 738 14 563 13 8

    AP consensus mean and CV: mean and CV calculated from the n values using the ISO 13528 (2005) robust algorithm A. n, number of values; n PG, number of peer groups for each analyte; median PG CV, median of the n peer group CVs.

    Authenticated | [email protected] author's copyDownload Date | 3/27/15 10:16 AM

  • 8Matar etal.: Uncertainty in measurement estimates from external quality assessment

    AF

    P

    CA

    125

    CA

    15-

    3

    CA

    19-

    9

    CE

    A

    Est

    radi

    ol

    FS

    H

    hCG

    LH

    PS

    A

    Fre

    e P

    SA

    Pro

    lact

    in

    Fre

    e T

    3

    Fre

    e T

    4

    TS

    H

    !! 23.5 27.4 27.4 29.4 23.5 29.4 29.4 29.4 29.4 29.4 29.4 29.4 29.4 29.4 !! 11.6 11.0 10.7 11.1 12.5 11.8 9.7 10.4 9.6 11.3 12.4 11.4 11.6 10.8 10.6

    22.0 18.3 15.5 20.1 18.2 25.8 15.8 19.0 19.5 22.4 18.8 19.3 20.0 14.4 31.1 21.9 35.4 20.8 46.3 24.7 26.9 21.2 27.9 33.6 29.4 11.3 8.7 23.7

    0%

    5%

    10%

    15%

    20%

    25%

    30%

    35%

    40%

    45%

    50% assaultVV AB Median LTUM 90th percentile LTUM Ricos TE

    Figure 3: Immunoassay analytes LTUM evaluation. Median (50th percentile) and 90th percentile (cross point) LTUM of 50 laboratories evaluated from the peer group (PG) target values are compared to the Vassault etal. [17] intermediate concentration level AB, and to Ricos desirable TE (January 2014) for 15 tumor markers and hormones.

    Even when PG consensus value was used, the situation was improved only for aPTT and ProT LTUM compared to PBQ AL, but not with Ricos TE. For factor V, both AP LTUM (18.4%) and PG LTUM (17.1%) were higher than the only existing analytical goal (PBQ AL 12%). Figure 4 shows the relative importance of the two components (LCV and LTB) of AP and PG LTUM expressed as percentages. For aPTT, LTB decreased from 79% (AP) to 25% (PG), for ProT from 44% (AP) to 21% (PG), and for fibrinogen from 31% to 19% (Figure 4C, D). These three tests are chronometric, so the coagulation time depends on reagent types and detection methods, which could explain the LTB decrease when considering PG instead of AP consensus value. In con-trast, factor V and antithrombin AP and PG LTB percent-ages were almost the same (about 10%), in keeping with the closeness of AP and PG LTUM (Figure 4C, D). However, LCV percentages were very high for antithrombin and for factor V (about 90%), indicating the heterogeneity of within-laboratory assay conditions. Thus, the ratio LCV/LTB (with LTB calculated from AP consensus mean) could be considered as a measure of the relevance of applying AP or PG consensus value for bias estimation.

    SH GTA 14/LTUM comparison

    In order to compare the COFRAC recommended method (SH GTA 14 IQC/EQA) to the LTUM approach, data from 20 laboratories, participating to both PBQ IQC and EQA schemes in biochemistry and hemostasis, were used to estimate the UM of 19 routines analytes. For immunoas-says, comparison between the two UM method assess-ments was impossible because of insufficient IQC data. The within-laboratory CV (Equation 6) was estimated for each of 20 laboratories from 10 to 15 results sent monthly and cumulated over the 25-month period, at two concen-tration levels close to the upper and the lower limits of the reference range (biochemistry analytes) or corresponding to normal and pathological levels (hemostasis analytes). The SH GTA 14 UM and the LTUM were evaluated for each concentration level, using the AP or PG target value.

    For biochemistry analytes, SH GTA 14 UM was higher or similar to LTUM ( Figure 5 ) whatever the consen-sus mean (AP or PG) used. A slight decrease of UM was observed when the PG consensus mean was used instead of the AP consensus mean.

    Authenticated | [email protected] author's copyDownload Date | 3/27/15 10:16 AM

  • Matar etal.: Uncertainty in measurement estimates from external quality assessment9

    For hemostasis analytes, AP SH GTA 14 UM was slightly higher than AP LTUM, except for aPTT, at a normal level whereas they were very similar at a pathological level ( Figure 6 ). When PG target value was considered instead of AP results, SH GTA 14 UM and LTUM decreased for all the analytes, particularly for aPTT, at both concentration levels.

    Discussion The aim of this work was to validate a simple and practical method of calculating the UM applicable to medical labo-ratories using only EQA results without any need for IQC data. For this purpose, we estimated LTUM for 43 routine analytes using data from our EQA schemes and compared it to analytical goals commonly used in laboratory medi-cine (state-of-the-art and biological variations). This esti-mation was then evaluated by comparison with a method based on a combination of IQC and EQA results and rec-ommended by COFRAC: the SH GTA 14 approach.

    The LTUM approach is based on linear regression between laboratory data and comparison group means. Furthermore, the regression line parameters (slope and intercept) are estimations of proportional and constant biases respectively, allowing the use of control samples at different concentration levels. The quality of these estimates is better as the number of degrees of freedom (number of EQA samples minus two) increases. Con-versely with the SH GTA 14 method, control samples must be sorted according to their concentration level in order to avoid UM overestimation [s E term in Equation 7 represent-ing standard deviation of bias] due to PB if concentration units are used for calculation, or to CB if relative values (percentages) are used. Here, we observed an overestima-tion of SH GTA 14 UM mainly at a low concentration level (data not shown) when s WL and bias (expressed in concen-tration units) were used compared to UM obtained when relative values (CV WL and bias expressed as a percentage of the target value) were applied. This observation sug-gests the preponderance of a PB component in the EQA data processed.

    0%

    5%

    10%

    15%

    20%

    B

    0%

    5%

    10%

    15%

    20%Median LTUM PBQ AL Ricos TEA

    0%10%20%30%40%50%60%70%80%90%

    100%

    Antithromb III Factor V Fibrinogen ProT aPTT

    D

    0%10%20%30%40%50%60%70%80%90%

    100%

    Antithromb III Factor V Fibrinogen ProT aPTT

    LTB LCV C

    Antithromb III Factor V Fibrinogen ProT aPTT

    13.4% 18.4% 13.4% 12.6% 18.1%

    12.0% 12.0% 12.0% 10.0% 10.0%

    8.3% - 13.6% 5.3% 4.5%

    Antithromb III Factor V Fibrinogen ProT aPTT

    12.8% 17.1% 11.1% 9.2% 6.4%

    12.0% 12.0% 12.0% 10.0% 10.0%

    8.3% - 13.6% 5.3% 4.5%

    Figure 4: Hemostasis analytes LTUM. Median LTUMs of 50 laboratories evaluated from AP (A), or PG (B) consensus values are compared to PBQ AL and to Ricos desirable TE (January 2014). Contribution of the two LTUM components (LTB, LCV) for AP (C) and PG (D) results, expressed as percentages. ProT is expressed in % and aPTT in seconds.

    Authenticated | [email protected] author's copyDownload Date | 3/27/15 10:16 AM

  • 10Matar etal.: Uncertainty in measurement estimates from external quality assessment

    0%

    5%

    10%

    15%

    20%

    25%

    PG LTUM low concentration level PG SH GTA 14 low concentration level

    0%

    5%

    10%

    15%

    20%

    25%

    PG LTUM high concentration level PG SH GTA 14 high concentration level

    0%

    5%

    10%

    15%

    20%

    25%

    AP LTUM low concentration level AP SH GTA 14 low concentration level

    A

    Bic

    arbo

    nate

    s

    Tot B

    iliru

    bin

    Chl

    orid

    e

    Cho

    lest

    erol

    Cre

    atin

    ine

    Iron

    Pot

    assi

    um

    Sod

    ium

    Trig

    lyce

    ride

    Ura

    te

    16.5 20.9 7.9 8.2 19.4 12.5 5.2 3.3 10.4 6

    19.7 17.6 7.2 8.8 21.4 12.8 5.5 4.4 14.2 7.3

    B'

    Bic

    arbo

    nate

    s

    Tot B

    iliru

    bin

    Chl

    orid

    e

    Cho

    lest

    erol

    Cre

    atin

    ine

    Iron

    Pot

    assi

    um

    Sod

    ium

    Trig

    lyce

    ride

    Ura

    te

    ALA

    T

    AS

    AT

    CK

    GG

    T

    17.9 7.7 4.4 5.5 6.8 6.1 3.3 3.5 6.7 5.4 6.8 6.3 8.0 6.8

    18.9 11.9 5.0 7.8 10.7 7.7 4.4 4.0 10.0 6.0 10.4 8.7 13.2 9.2

    Bic

    arbo

    nate

    s

    Tot B

    iliru

    bin

    Chl

    orid

    e

    Cho

    lest

    erol

    Cre

    atin

    ine

    Iron

    Pot

    assi

    um

    Sod

    ium

    Trig

    lyce

    ride

    Ura

    te

    ALA

    T

    AS

    AT

    CK

    GG

    T

    14.0 12.5 3.9 5.9 9.8 8.0 3.7 2.6 6.8 4.2 8.7 9.3 14.6 10.0

    18.8 15.6 5.5 7.9 13.6 11.0 4.5 3.9 12.9 6.1 13.9 10.2 16.6 10.1

    A'

    0%

    5%

    10%

    15%

    20%

    25%

    AP LTUM high concentration level AP SH GTA 14 high concentration level

    B

    Bic

    arbo

    nate

    s

    Tot B

    iliru

    bin

    Chl

    orid

    e

    Cho

    lest

    erol

    Cre

    atin

    ine

    Iron

    Pot

    assi

    um

    Sod

    ium

    Trig

    lyce

    ride

    Ura

    te

    18.7 17.2 6.9 8.9 12.3 9.4 3.9 4.3 11.0 6.3

    22.5 15.2 7.4 9.4 17.0 9.4 5.4 4.3 12.4 8.9

    Figure 5: SH GTA 14/LTUM comparison. Median UMs of 20 laboratories for 14 routine biochemistry analytes determined by SH GTA 14 and LTUM methods at low (left) and high (right) concentration level, using AP (A and B) and PG (A and B ) results from their participation to PBQ IQC and EQA schemes.

    Several assumptions are necessary for the validity of the LTUM estimation: linearity between EQA laboratory values and target values, equality of variance in the con-centration range explored (homoscedasticity) and com-mutability of the control samples.

    The LTUM method has the advantage of fully taking into account the evolution of the bias as a function of con-centration. However, it provides a single estimate of the variability of the regression line (s y/x ) for the whole con-centration range explored by the control samples while the trend of standard deviation to vary with concentration is well known (increase for biochemistry and immunoas-say analytes and decrease for coagulation parameters). Therefore, the estimation of the variance obtained by this method could be reliable only for analytes whose clini-cally relevant concentration range is limited. Otherwise, for hormones such as hCG or TSH, it will be necessary to regroup control samples according to their concentration level as in the case of the SH GTA 14 method. As expected, our results (Figures 5 and 6) showed no significant differ-ence between LTUM at low and high concentration levels.

    In addition, UM values obtained by the two methods tested are very similar indicating that heteroscedasticity is not a limiting factor when applying the LTUM method to biochemistry and hemostasis routine measurands.

    Another important condition to demonstrate the potential of this UM estimate approach is the commut-ability of EQA samples with those of patients. Only in this case, the bias estimate will be reliable reflecting the discrepancies between measurement procedures and not the matrix related bias. Indeed, the most desirable PT/EQA programs are those that use commutable samples with target values assigned by a reference method (cat-egories 1 and 2 in reference [18] ). Although fresh samples of human origin are distributed in some of our programs, data used in this study came only from processed (mostly lyophilized and spiked or diluted to adjust the concentra-tion levels) and therefore likely non-commutable control materials. IQC samples, used for the SH GTA 14 method, must also be commutable to provide the best estimate of within laboratory precision. So in order to validate the estimation of the UM by the method proposed here, it is

    Authenticated | [email protected] author's copyDownload Date | 3/27/15 10:16 AM

  • Matar etal.: Uncertainty in measurement estimates from external quality assessment11

    necessary to characterize control samples with respect to their commutability. This could be done by comparing their behavior to a representative panel of patient samples for combinations of the most used measurement pro-cedures as described in the CLSI document EP30-A [19] . A limit of our study is that the commutability of control samples was not tested and therefore a matrix related bias could not be excluded for some diagnostic systems. In this case, all methods relying on EQA/IQC data to estimate the systematic component of TE, are likely to give incorrect UM values. For this reason, the LTUM estimates presently provided to participants in PBQ EQA schemes only use the peer group target values. An alternative could be to use the GUM Modeling approach which is a method less prone to this drawback but more difficult to implement in medical laboratories. However, our main goal in this work was not to obtain a reliable estimate of bias but to test a convenient method for calculating the UM and to compare it to a recommended method which also requires the com-mutability of samples.

    At last, the UM estimate is tightly dependent on the choice of the target value as shown by the different values

    observed when the homogeneous group (PG) mean is used instead of the mean of AP. Of course, the use of a TE estimate based on APs rather than on peer group EQA data is more appropriate to obtain valid conclusions regarding the suitability of routine methods for meeting TE goals for interpretation of patient results. However, the present standardization status of some analytes (tumor markers) requires the use of the same method (in the same labora-tory) for the monitoring of patients and justifies our bias estimation by comparison to the peer group. Furthermore, the UM estimates obtained using both references were in close agreement for many measurands except those for which standardization status is still a matter of concern (enzymes, immunoassays).

    In conclusion, this study allowed validation of UM calculation by a simple method that uses only the results of EQA schemes. It provided estimates comparable to methods recommended by official bodies and enabled biol-ogists to compare estimates obtained by different methods. As an EQA organizer, we provide all participants at the end of each survey, since January 2014, with an estimate of UM by this method using data from control samples assayed

    0%

    5%

    10%

    15%

    20%

    25%

    Fibrinogen ProT aPTT

    PG LTUM normal level PG SH GTA 14 normal level

    0%

    5%

    10%

    15%

    20%

    25%

    Fibrinogen ProT aPTT

    AP LTUM normal level AP SH GTA 14 normal level

    0%

    5%

    10%

    15%

    20%

    25%

    Fibrinogen ProT aPTT

    AP LTUM pathologic level

    AP SH GTA 14 pathologic level B

    11.3% 12.4%

    15.6% 14.8%

    15.9% 15.7%

    A

    11.7% 13.0%

    11.8%

    14.6%

    19.6%

    16.6%

    A'

    9.8%

    12.5% 10.8% 11.6%

    6.7%

    9.5%

    0%

    5%

    10%

    15%

    20%

    25%

    Fibrinogen ProT aPTT

    PG LTUM pathologic level PG SH GTA 14 pathologic level

    B'

    9.5% 11.7%

    9.2%

    11.8%

    5.7%

    9.3%

    Figure 6: SH GTA 14/LTUM comparison for hemostasis analytes. Median UMs of 20 laboratories for three hemostasis analytes estimated by the SH GTA 14 and LTUM approaches at normal and pathologic levels from AP (A and B) and PG (A and B ) results.

    Authenticated | [email protected] author's copyDownload Date | 3/27/15 10:16 AM

  • 12Matar etal.: Uncertainty in measurement estimates from external quality assessment

    during the current year and the previous 2 years. The accu-mulation of results over time for a large panel of analytes and a great number of clinical laboratories will allow better appreciation of its advantages and disadvantages.

    Author contributions: All the authors have accepted responsibility for the entire content of this submitted manuscript and approved submission. Financial support: None declared. Employment or leadership: None declared. Honorarium: None declared. Competing interests: The funding organization(s) played no role in the study design; in the collection, analysis, and interpretation of data; in the writing of the report; or in the decision to submit the report for publication.

    References 1. Guidelines for evaluating and expressing the uncertainty of NIST

    measurement results. NIST Technical Note 1994:1297. 2. White GH, Farrance I. Uncertainty of measurement in quantitative

    medical testing a laboratory implementation guide. AACB Clin Biochem Rev 2004;25(Suppl):S124.

    3. Fuentes AX. Uncertainty of measurement in clinical microbiology. In: eJIFCC 2004;13. Available from: http://www.ifcc.org/ifccfiles/docs/130401006.pdf . Accessed November 2004.

    4. International Laboratory Accreditation Cooperation (ILAC). ILAC G-2:1994. Available from: http://www.ilac.org . Accessed Novem-ber 2004.

    5. Guide to the expression of uncertainty in measurement. Geneva: International Organization for Standardization (ISO), 1993.

    6. ISO 17025: General requirements for the competence of testing and calibration laboratories. Geneva: International Organization for Standardization (ISO), 2005.

    7. ISO 15189: Medical laboratories particular requirements for quality and competence. Geneva: International Organization for Standardization (ISO), 2012.

    8. Evaluation of measurement data Guide to the expression of uncertainty in measurement. JCGM 100:2008.

    9. Measurement uncertainty revisited: alternative approaches to uncertainty evaluation. European Federation of National Asso-ciations of Measurement, Testing and Analytical Laboratories, EUROLAB. Technical Report No. 1/2007.

    10. SH GTA 14: Technical guide for accreditation for the uncertainty measurement assessment in medical biology. Available from: http://www.cofrac.fr . Accessed 2011.

    11. Meijer P, de Maat MP, Kluft C, Haverkate F, van Houwelingen HC. Long-term analytical performance of hemostasis field methods as assessed by evaluation of the results of an external quality assessment program for antithrombin. Clin Chem 2002;48:1011 5.

    12. ISO 13528: Statistical methods for use in proficiency testing by interlaboratory comparisons. Geneva: International Organiza-tion for Standardization (ISO), 2005.

    13. ISO 17043: Conformity assessment general requirements for proficiency testing. Geneva: International Organization for Standardization (ISO), 2010.

    14. Fisicaro P, Amarouche S, Lalere B, Labarraque G, Priel M. Approaches to uncertainty evaluation based on proficiency testing schemes in chemical measurements. Accred Qual Assur 2008;13:361 6.

    15. Amarouche, S, Priel M, De Graeve J. Exploiting proficiency test-ing results, a new alternative to the evaluation of uncertainties: application in medical biology: dosage of glucose in human plasma. Workshop on the Impact of Information Technology in Metrology. BIPM-PTB, Berlin, 2007.

    16. SH GTA 06: Technical guide for accreditation: quality assess-ment in medical biology. Available from: http://www.cofrac.fr . Accessed 2012.

    17. Vassault A, Grafmeyer D, de Graeve J, Cohen R, Beaudonnet A, Bienvenu J. Analyses de biologie m dicale: sp cifications et normes d acceptabilit l usage de la validation de techniques. Ann Biol Clin 1999;57:685 95.

    18. Miller WG, Jones GR, Horowitz GL, Weykamp C. Proficiency test-ing/external quality assessment: current challenges and future directions. Clin Chem 2011;57:1670 80.

    19. CLSI. Characterization and qualification of commutable refer-ence materials for laboratory medicine; CLSI document EP30-A. Wayne, PA: Clinical and Laboratory Standards Institute, 2010.

    Authenticated | [email protected] author's copyDownload Date | 3/27/15 10:16 AM