23 Novl CA- Co Studies

download 23 Novl CA- Co Studies

of 59

Transcript of 23 Novl CA- Co Studies

  • 8/2/2019 23 Novl CA- Co Studies

    1/59

    Dr. P. P. DokeProfessor, Department of Preventive and Social Medicine

    MGM Medical College, Kamothe, Navi Mumbai

    M.D., DNB., Ph.D., FIPHA

  • 8/2/2019 23 Novl CA- Co Studies

    2/59

    The case-control study is an analyticepidemiologic research design in which thestudy population consists of groups whoeither have (cases) or do not have a particular

    health problem or outcome (controls)

    The investigator looks back in time tomeasure exposure of the study subjects. The

    exposure is then compared among cases andcontrols to determine if the exposure couldaccount for the health condition of the cases

  • 8/2/2019 23 Novl CA- Co Studies

    3/59

    Case-Referent

    Case-Compeer

    Retrospective ?

  • 8/2/2019 23 Novl CA- Co Studies

    4/59

    Observational / Non-experimental Occasionally Exploratory Explanatory (Analytical)

    Retrospective Effect to Cause Both Exposure & Disease have alreadyoccurred

    Uses Comparison Group

  • 8/2/2019 23 Novl CA- Co Studies

    5/59

    Consider some rare disease say some

    cancer (leukemia)

    Crude Annual Incidence = 3.4/100000 (< 15 years) Cohort Study: A year of observation on a million

    children to identify 34 cases

    Sample of 34 cases may be available in hospitals :

    may be sub-divided in 2 or more exposurecategories

    Easy to carry conduct case-control study

  • 8/2/2019 23 Novl CA- Co Studies

    6/59

    Long induction period between the

    exposure and clinical onset of disease

    Cohort Study: Waiting years for accrual of

    cases

    Case-Control Study: Compress time

    Case-Control Studies hence suitable for

    Chronic Diseases (Cancer / Cardiovascular

    Diseases)

  • 8/2/2019 23 Novl CA- Co Studies

    7/59

    RCT: Methodological Standard of ExcellenceHowever,

    Case-Control; Not only SIMPLE to performbut some times the ONLY approach to solve

    a problem. Philosophically no design is Gold

    Standard. Understand strengths and weaknesses . Select appropriate study design to address

    your Research Question

  • 8/2/2019 23 Novl CA- Co Studies

    8/59

    1. DirectionalityOutcome to exposure

    2. TimingRetrospective for exposure, but case-

    ascertainment can be either retrospectiveor concurrent

    3. Sampling

    Almost always on outcome, withmatching of controls to cases

  • 8/2/2019 23 Novl CA- Co Studies

    9/59

    Not Exposed Exposed Not Exposed

    Disease No Disease

    CASES CONTROLS

    Exposed

  • 8/2/2019 23 Novl CA- Co Studies

    10/59

    With a Specific Outcome: Presence of Disease / Syndrome

    Complications / progression of Disease (Severe dehydration

    crisis)

    Death (Neonatal mortality)

    Serum cholesterol / Birth weight

    Delayed Immunization

    Early Initiation of Cigarette Smoking

    Adverse Reactions of Drugs / Vaccines (SIDS)

    Behavior (Juvenile Delinquency) Drug Resistance (MDR-TB)

    Couple as a case (Infertility)

  • 8/2/2019 23 Novl CA- Co Studies

    11/59

    Diagnostic Criteria Risk of Disease Misclassification

    Continuous / Discrete Outcome Variable

    Relatively simple & straightforward: Children with cleft palates (physical

    examination)

    Sometimes difficult: Hypertension

    Diagnosis: Combination of methods

    Rationale / Logical

    Criteria Specific

    Operational versus Rigid

    Standard Definition (WHO, CDC, etc)

    Reference (growth references NCHS, CDC, New WHO)

  • 8/2/2019 23 Novl CA- Co Studies

    12/59

    Eligibility Criteria Inclusion/Exclusion criteria Ca-Co studies should be limited to incident cases :

    Exposures are presumably more recent and therefore more

    reliably recalled. Relatively homogeneous group

    Exclusion of prevalent cases: Minimize the Selection Bias(Neyman Fallacy).

    Ex: PID and IUD Use

    Women who are not sexually active or who have had atubal ligation are not likely to have recently used anycontraceptive method including IUDs

  • 8/2/2019 23 Novl CA- Co Studies

    13/59

    Conceptual definition Obesity defined as body fat percentage > 33%

    Operational definition Body Mass Index > 30

  • 8/2/2019 23 Novl CA- Co Studies

    14/59

    Case definition should avoid misclassification For example:

    Anemia was defined as Hemoglobin < 110gm/L as measured by WHO Colour Scale

    WHO Colour Scale over-estimates thehemoglobin

    Misclassified cases with mild anemia

    Also, studying mild forms of cases, gives larger

    case group; but misclassifies cases as non-cases OR non-cases as cases as early diagnosisis generally imprecise

  • 8/2/2019 23 Novl CA- Co Studies

    15/59

    A severe case definition may exclude peoplewho have been cured or who died ofdisease before the condition was severeenough to be labelled as case

    Standard/consensus definitions if available,must be used For example,

    Rheumatoid arthritis Rome criteria, NY criteria, 1987ARC criteria

    Lack of agreement over definition may introducevariability in estimates of effect

  • 8/2/2019 23 Novl CA- Co Studies

    16/59

    The issues of severity, diagnostic criteria andsubjectivity of criteria all lead to potential problemsof misclassification of cases

    The researcher can choose between morerestrictive and inclusive definitions

    Think in terms of sensitivity and specificity ofdefinition and its effect on validity, sample size,precision and power

    It is observed that; Restrictive definition (less sensitive) leads to lack of

    precision and power by reducing sample size Broad criteria (less specificity) produce misclassification

    leading to biased measure of effect So, weigh validity - specificity over sensitivity (Restrictive

    definition over inclusive definition)

  • 8/2/2019 23 Novl CA- Co Studies

    17/59

    Hospitals (Multi-Centric Studies)

    Community

    Industrial Population

  • 8/2/2019 23 Novl CA- Co Studies

    18/59

    The goal is to Ensure that all true cases have an equal probability

    of entering the study and that no false cases enter

    Example: Conceptual definition of HIV

    Factors affecting decision to test/access the test andSn & Sp of test will decide who eventually becomes acase under operational definition

    Selection bias ??

  • 8/2/2019 23 Novl CA- Co Studies

    19/59

    Selection bias Unequal chance of getting into study

    Berksons bias Variable rate of hospitalization affecting case

    selection Neyman fallacy

    Incident case Vs prevalent case

    Detection bias Due to closer medical attention, detection of

    endometrial cancer was more in a group usingestrogen

  • 8/2/2019 23 Novl CA- Co Studies

    20/59

    1. Representativeness:Ideally, cases should be a

    random sample of all cases ofinterest in the source population(e.g. from vital data, registry data).More commonly they are a selection

    of available cases from a medicalcare facility. (e.g. from hospitals,clinics)

  • 8/2/2019 23 Novl CA- Co Studies

    21/59

    2. Method of SelectionSelection may be from

    incidence or prevalence case:

    Incident cases are those derivedfrom ongoing-ascertainment ofcases over time.

    Prevalent cases are derivedfrom a cross-sectional survey.

  • 8/2/2019 23 Novl CA- Co Studies

    22/59

    Who is the best control? What universe

    should controls come from?

    If cases are a random sample of cases inthe population. Then controls should be a

    random sample of all non-cases in the

    population sampled at the same time.

  • 8/2/2019 23 Novl CA- Co Studies

    23/59

    Comparability is more important thanrepresentativeness in the selection of controls

    The control should be at risk of the disease The control should resemble the case in all

    respects except for the presence of disease(and any as yet undiscovered risk factors for

    disease)

  • 8/2/2019 23 Novl CA- Co Studies

    24/59

    Usually, cases in a case-control

    study are not a random sample of allcases in the population. And if so, the

    controls must be selected in the same

    way (and with the same biases) as the

    cases.

  • 8/2/2019 23 Novl CA- Co Studies

    25/59

    If follows from the above, that a pool ofpotential controls must be defined. This isa universe of people from whom controls

    may be selected (study base).

    Comparability vs. Representativeness

  • 8/2/2019 23 Novl CA- Co Studies

    26/59

    1. The study base Source of case and the control should be the same

    2. Deconfounding

    3. Comparable accuracy Similar misclassification errors in cases & controls Same potential of recall bias in cases & control

    4. Efficiency

  • 8/2/2019 23 Novl CA- Co Studies

    27/59

    Hospital or clinic control Dead control

    Controls with similar diseases

    Peer or case-nominated (friend/neighbor)control

    Population controls

  • 8/2/2019 23 Novl CA- Co Studies

    28/59

    Readily available hence commonly used Main reasons to use hospital controls are

    To select controls whose referral pattern is similarto cases

    To obtain similar quality of examination For convenience

    May not be representative of the population

  • 8/2/2019 23 Novl CA- Co Studies

    29/59

    Might use dead controls for dead cases In some situations, this might lead to use of

    surrogate informant The problem is the dead control is not

    representative of the living population McLaughlin compared dead controls withliving controls and noticed that the deadcontrols smoked more cigarettes andconsumed more alcohol than living controls

    Appropriateness depends on the exposurebeing studied

  • 8/2/2019 23 Novl CA- Co Studies

    30/59

    Reasons To minimize the recall bias

    To minimize the interviewer bias

    To examine the specificity of an exposure for a

    particular type of cancer For practical but unspecified reasons

    Problem ??

  • 8/2/2019 23 Novl CA- Co Studies

    31/59

    Neighborhood controls is used in two ways: To refer to community or population controls To refer to controls selected from finite number

    of close neighbors Search starts from house of the case and door-to-door

    search conducted for eligible controls in a standardizedpattern

    Friend or neighbor control is a surrogate formatching on age, education, etc A quick way to find control

    Bias is introduced if determinants of friendshipare associated with disease or exposure Friends share many risk behaviors

  • 8/2/2019 23 Novl CA- Co Studies

    32/59

    Randomly drawn from population Truly representative of population

    Ideal way of selecting controls

    Practically, very difficult to carry out Study base ???

  • 8/2/2019 23 Novl CA- Co Studies

    33/59

    Way the pros and cons Analyze the situation for bias being

    introduced

    If possible, select different sources of controls and compare

    with each other

    Compare the inferences drawn

  • 8/2/2019 23 Novl CA- Co Studies

    34/59

    Statistical consideration When the number of subjects available in one group

    (cases) is limited, an increase in the other groupincreases the study power

    Gain in power is till the ratio of 4:1 Thereafter, the gain is not substantial but cost

    increases When the study of power with equal allocation is as

    high as 0.9 or as low as 0.1, additional fails toincrease the power

  • 8/2/2019 23 Novl CA- Co Studies

    35/59

    Validity of inferences Even when there is no statistical need, more than

    one control may be recruited per case Enrolling two or more types of controls is a way of

    checking for biases introduced by choice of controlgroup

    If the measure of effect is similar when comparingcases with each control group Probably no biases (no surety)

    If different measure of effect, then the bias is thereand the researcher can understand it

  • 8/2/2019 23 Novl CA- Co Studies

    36/59

    MATCHING Purpose: To adjust - effects of relevant

    confounders

    Matching in Design - Accounted in Analysis

    Misconception: The goal is to make thecase and control groups similar in allrespects, except for disease status

    An Optimal Matching Scheme involves only

    those variables which improve statisticalefficiency or eliminate bias from the effectof interest

  • 8/2/2019 23 Novl CA- Co Studies

    37/59

    MATCHING Which variables are appropriate for matching?

    Risk factors from prior work may be identifiedfor matching

    Matching by interviewer or hospital may be usedto balance out the effects of interviewer andobserver errors

    It is best to limit matching to basic descriptors

    (age, sex, socio-economic status, etc) Non-modifiable risk factors

    Use few matching factors

  • 8/2/2019 23 Novl CA- Co Studies

    38/59

    MATCHINGOverzealous matching may have adverse

    effects:

    Matching on a strong correlate of theexposure, which is not an independent riskfactor for the outcome (overmatching) maylead to an underestimate of OR

    Matching may lead to a false sense ofsecurity that a particular variable isadequately controlled

  • 8/2/2019 23 Novl CA- Co Studies

    39/59

    1. Control selection is usually throughmatching.

    Matching variables (e.g. age), andmatching criteria (e.g. within the same 5

    year age group) must be set up in advance.

  • 8/2/2019 23 Novl CA- Co Studies

    40/59

    2.Controls can be individually matched (most

    common) or Frequency matched.

    Individual matching: search for one (or more)controls who have the required matching

    criteria, paired (triplet) matching is when thereis one (two) control (s) individually matched to

    each cases.

    Frequency matching: select a population ofcontrols such that the overall characteristics ofthe case, e.g. if 15% cases are under age 20,

    15% of the controls are also

  • 8/2/2019 23 Novl CA- Co Studies

    41/59

    3. Avoid over-matching, match only onfactors KNOWN to be cause of the disease.

    4. Obtain POWER by matching MORE THANONE CONTROL per case. In general, N ofcontrols should be < 4, because there isno further gain of power above that.

    5. Obtain Generalizability by matching bymatching more than one type of control.

  • 8/2/2019 23 Novl CA- Co Studies

    42/59

    Various soft wares are available

  • 8/2/2019 23 Novl CA- Co Studies

    43/59

    Questionnaires Records

    Conversion tables/algorithms

  • 8/2/2019 23 Novl CA- Co Studies

    44/59

    Questionnaire Question comprehension Information retrieval Response formulation and recording

    Quality of exposure reports may beinfluenced by Type of respondent Administration of questionnaire Salience of exposure Way in which information is retrieved Ways in which responses are formulated and

    recorded

  • 8/2/2019 23 Novl CA- Co Studies

    45/59

    Records Abstraction of data from record

    Quality control measures are important

    Careful design and testing of abstraction form

    Training and supervision of abstractors Priori definition of terms

    Specifications of rules for handling conflicting ormissing data

  • 8/2/2019 23 Novl CA- Co Studies

    46/59

    FIRST: Select

    CASES CONTROLS(With Disease) (Without Disease)

    THEN: Were exposed a b

    Measure

    Exposure Were not exposed c d

    TOTALS a + c b + d

    Proportions a bExposed a + c b + d

  • 8/2/2019 23 Novl CA- Co Studies

    47/59

    ac ad

    Odds Ratio = =

    b bcd

    Risk = a

    a + b= c

    c + d

    a b

    c d

    Case Control

    E+

    E-

  • 8/2/2019 23 Novl CA- Co Studies

    48/59

    Case

    Exposed UnexposedExposed Both Mixed

    Controls

    Unexposed Mixed Neither

  • 8/2/2019 23 Novl CA- Co Studies

    49/59

    For one control

    Case

    Exposed UnexposedExposed r s

    Controls

    Unexposed t u

    McNemar 2=(t+s)2/(t-s)

  • 8/2/2019 23 Novl CA- Co Studies

    50/59

    Stroke Control totalHypertension 30 10 40

    No hypertension 70 90 160

    Total 100 100 200

    30x9010x70

    Odds ratio = = 3.86

  • 8/2/2019 23 Novl CA- Co Studies

    51/59

    CaseHypertension No hypertension Total

    Hypertension

    Control

    No hypertension

    2 8 10

    28 62 90

    Total 30 70 100

    McNemar 2=(t+s)2/(t-s) =(28+8)2/(28-8)= 34.61

  • 8/2/2019 23 Novl CA- Co Studies

    52/59

    Advantages:1. Only realistic study design for

    uncovering etiology in rare diseases2. Important in understanding newdiseases3. Commonly used in outbreaks

    investigation4. Useful ifinducing period is long5. Relatively inexpensive

  • 8/2/2019 23 Novl CA- Co Studies

    53/59

    1. Susceptible to bias if not carefully

    designed

    2. Especially susceptible to exposure

    misclassification

    3. Especially susceptible to recall bias

    4. Restricted to single outcome

    5. Incidence rates not usually calculate6. Cannot assess effects ofmatching

    variables

  • 8/2/2019 23 Novl CA- Co Studies

    54/59

    Dolls 1952 study of smoking and lungcancer. The problem was that the controlpopulation ( lung disease) was biased in

    relation to the exposure. McMahons 1981 study of coffee and

    pancreatic cancer. Problem was that someof the controls may have been biased in

    relation to the exposure, because diseasesrelated to coffee were excluded from thecontrol series.

  • 8/2/2019 23 Novl CA- Co Studies

    55/59

    1950s Cigarette smoking and lung cancer

    1970s Diethyl stilbestrol and vaginal

    adenocarcinoma

    Post-menopausal estrogens and endometrial

    cancer

  • 8/2/2019 23 Novl CA- Co Studies

    56/59

    1980 s Aspirin and Reyes sydrome

    Tampon use and toxic shocks syndrome

    L-tryptopham and eosinophilia-myalgiasyndrome AIDS and sexual practices1990s Vaccine effectiveness Diet and cancer

    Famous Examples and discoveries

  • 8/2/2019 23 Novl CA- Co Studies

    57/59

    The odds ratio is a good estimate of the

    relative risk when the disease is rare

    (prevalence 1 controls

    Statistical testing is by simple chi-square

    (unmatched analysis) or by McNemars

    chi- square (matched-pairs analysis) Can be extended to multiple strata

    ( Mantel-Haenzel chi-square)

  • 8/2/2019 23 Novl CA- Co Studies

    58/59

  • 8/2/2019 23 Novl CA- Co Studies

    59/59

    Thank you