€¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across...

60
Title: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples Authors: Emma C. Johnson 1 *, Sandra Sanchez-Roige 2 *, Laura Acion 3 , Mark J. Adams 4 , Kathleen K. Bucholz 2 , Grace Chan 5 , Michael J. Chao 6 , David B. Chorlian 7 , Danielle M Dick 8,9 , Howard J. Edenberg 10,11 , Tatiana Foroud 11 , Caroline Hayward 12 , Jon Heron 13 , Victor Hesselbrock 5 , Matthew Hickman 13 , Kenneth S. Kendler 14 , Sivan Kinreich 7 , John Kramer 3 , Sally I-Chun Kuo 8 , Samuel Kuperman 3 , Dongbing Lai 11 , Andrew M. McIntosh 4 , Jacquelyn L. Meyers 7 , Martin H. Plawecki 15 , Bernice Porjesz 7 , David Porteous 16 , Marc A Schuckit 2 , Jinni Su 17 , Yong Zang 18 , Abraham A. Palmer 2,19 , Arpana Agrawal 1 , Toni-Kim Clarke 4 **, Alexis C. Edwards 14 ** *Joint first authors; **Joint senior authors Affiliations: 1 Washington University School of Medicine, Department of Psychiatry, St. Louis, MO, USA 2 University of California San Diego, Department of Psychiatry, San Diego, CA, USA 3 University of Iowa, Carver College of Medicine, Department of Psychiatry, Iowa City, IA, USA 4 University of Edinburgh, Division of Psychiatry, Edinburgh, UK 5 University of Connecticut School of Medicine, Department of Psychiatry, Farmington, CT, USA 6 Icahn School of Medicine at Mt. Sinai, Department of Neuroscience, New York, NY, USA 7 Suny Downstate Medical Center, Department of Psychiatry, Brooklyn, NY, USA 8 Virginia Commonwealth University, Department of Psychology, Richmond, VA, USA 9 Virginia Commonwealth University, Department of Human and Molecular Genetics, Richmond, VA, USA 10 Indiana University School of Medicine, Department of Biochemistry and Molecular Biology, Indianapolis, IN, USA 11 Indiana University School of Medicine, Department of Medical and Molecular Genetics, Indianapolis, IN, USA

Transcript of €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across...

Page 1: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

Title: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples Authors:Emma C. Johnson1*, Sandra Sanchez-Roige2*, Laura Acion3, Mark J. Adams4, Kathleen K. Bucholz2, Grace Chan5, Michael J. Chao6, David B. Chorlian7, Danielle M Dick8,9, Howard J. Edenberg10,11, Tatiana Foroud11, Caroline Hayward12, Jon Heron13, Victor Hesselbrock5, Matthew Hickman13, Kenneth S. Kendler14, Sivan Kinreich7, John Kramer3, Sally I-Chun Kuo8, Samuel Kuperman3, Dongbing Lai11, Andrew M. McIntosh4, Jacquelyn L. Meyers7, Martin H. Plawecki15, Bernice Porjesz7, David Porteous16, Marc A Schuckit2, Jinni Su17, Yong Zang18, Abraham A. Palmer2,19, Arpana Agrawal1, Toni-Kim Clarke4**, Alexis C. Edwards14** *Joint first authors; **Joint senior authors

Affiliations:1Washington University School of Medicine, Department of Psychiatry, St. Louis, MO, USA2University of California San Diego, Department of Psychiatry, San Diego, CA, USA3University of Iowa, Carver College of Medicine, Department of Psychiatry, Iowa City, IA, USA4University of Edinburgh, Division of Psychiatry, Edinburgh, UK5University of Connecticut School of Medicine, Department of Psychiatry, Farmington, CT, USA6Icahn School of Medicine at Mt. Sinai, Department of Neuroscience, New York, NY, USA7Suny Downstate Medical Center, Department of Psychiatry, Brooklyn, NY, USA8Virginia Commonwealth University, Department of Psychology, Richmond, VA, USA9Virginia Commonwealth University, Department of Human and Molecular Genetics, Richmond, VA, USA10Indiana University School of Medicine, Department of Biochemistry and Molecular Biology, Indianapolis, IN, USA11Indiana University School of Medicine, Department of Medical and Molecular Genetics, Indianapolis, IN, USA12University of Edinburgh, Institute of Genetics and Molecular Medicine, MRC Human Genetics Unit, Edinburgh, UK13University of Bristol, Bristol Medical School, Population Health Sciences, Bristol, UK14Virginia Commonwealth University School of Medicine, Department of Psychiatry, Richmond, VA, USA15 Indiana University School of Medicine, Department of Psychiatry, Indianapolis, IN, USA16University of Edinburgh, Institute of Genetics & Molecular Medicine, Centre for Genomic and Experimental Medicine, Edinburgh, UK17Arizona State University, Department of Psychology, Tempe, AZ, USA.18Indiana University School of Medicine, Department of Biostatistics, Bloomington, IN, USA19University of California San Diego, Institute for Genomic Medicine, San Diego, CA, USA

Corresponding author: Emma C Johnson Email: [email protected]: 660 S. Euclid, CB 8134Saint Louis, MO 63110, USA

Page 2: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

Word Count: 4,963ABSTRACT

BACKGROUND

Studies suggest that alcohol consumption and alcohol use disorders have distinct genetic

backgrounds.

METHODS

We examined whether polygenic risk scores (PRS) for consumption and problem subscales of

the Alcohol Use Disorders Identification Test (AUDIT-C, AUDIT-P) in the UK Biobank (UKB;

N=121,630) correlate with alcohol outcomes in four independent samples: an ascertained

cohort, the Collaborative Study on the Genetics of Alcoholism (COGA; N=6,850), and

population-based cohorts: Avon Longitudinal Study of Parents and Children (ALSPAC;

N=5,911), Generation Scotland (GS; N=17,461), and an independent subset of UKB

(N=245,947). Regression models and survival analyses tested whether the PRS were

associated with the alcohol-related outcomes.

RESULTS

In COGA, AUDIT-P PRS was associated with alcohol dependence, AUD symptom count,

maximum drinks (R2=0.47% - 0.68%, p=2.0e-8 - 1.0e-10), and increased likelihood of onset of

alcohol dependence (Hazard Ratio =1.15, p=4.7e-08); AUDIT-C PRS was not an independent

predictor of any phenotype. In ALSPAC, the AUDIT-C PRS was associated with alcohol

dependence (R2=0.96%, p=4.8e-6). In GS, AUDIT-C PRS was a better predictor of weekly

alcohol use (R2=0.27%, p=5.5e-11), while AUDIT-P PRS was more associated with problem

drinking (R2=0.40%, p=9.0e-7). Lastly, AUDIT-P PRS was associated with ICD-based alcohol-

related disorders in the UKB subset (R2=0.18%, p<2.0e-16).

2

Page 3: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

CONCLUSIONS

AUDIT-P PRS was associated with a range of alcohol-related phenotypes across population-

based and ascertained cohorts, while AUDIT-C PRS showed less utility in the ascertained

cohort. We show that AUDIT-P is genetically correlated with both use and misuse and

demonstrate the influence of ascertainment schemes on PRS analyses.

3

Page 4: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

INTRODUCTION

Alcohol use, across the lifespan, can be characterized by a series of transitions: from early

experimentation to hazardous drinking and possible development of alcohol use disorders

(AUD), or to non-problematic alcohol intake. Some of these transitions are developmentally

salient: for instance, hazardous drinking is common during adolescence and early adulthood,

but does not always portend problematic use in later adulthood (Degenhardt et al., 2013;

Wennberg, Andersson, & Bohman, 2000). Similarly, individuals with AUD might remit, even to

abstinence, or persist into later life (McCutcheon et al., 2012; Trim, Schuckit, & Smith, 2013).

Furthermore, alcohol-related behavior can be broadly disarticulated into two components – the

extent to which an individual consumes alcohol and the potential problems that they experience

related to their intake. For instance, AUD are characterized not by excessive alcohol

consumption but by measurable physiological changes that accompany addiction (e.g.

withdrawal, tolerance) as well as loss of control over drinking and drinking despite physical and

emotional impairment (American Psychiatric Association, 2013). These complexities underlying

the life course of alcohol use and misuse pose challenges in the dissection of etiology.

Most stages of alcohol involvement include a heritable contribution (twin h2 = 40-70%), although

the magnitude of these genetic effects varies considerably across development (Edwards et al.,

2017; Enoch, 2006; Pagan et al., 2006). Common genetic variants from genome-wide

association studies (GWAS) explain 4-13% of the phenotypic variance in alcohol use and

misuse (Clarke et al., 2017; Kranzler et al., 2019; Liu et al., 2019; Sanchez-Roige et al., 2017;

Schumann et al., 2016; Walters et al., 2018). The largest GWAS of alcohol dependence to date

(Kranzler et al., 2019) suggests that the genetic correlation between alcohol consumption (units

per week) and ICD coded AUD is variable (rg ranging from 0.54 (beer/cider) to 0.004

(champagne/white wine)).

4

Page 5: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

A recent study (Sanchez-Roige et al., 2018) conducted GWAS of both the consumption and

problem subscales of the Alcohol Use Disorders Identification Test (AUDIT), a short screener

for drinking in the past year (Saunders et al.,1993). In the UK Biobank sample, genetic liability to

the consumption subscale (AUDIT-C; three items with information pertaining to alcohol

consumption) was positively correlated with educational achievement and unrelated to

psychopathology whereas liability to the problem subscale (AUDIT-P; seven items with

information pertaining to alcohol problems) was negatively correlated with educational

achievement and positively correlated with psychopathology. These findings are consistent with

the lack of genetic correlation between psychiatric illness and genetic liability to alcohol

consumption in the largest GWAS of the trait (drinks per week; Liu et al., 2019). In contrast, two

recent studies suggest moderate genetic correlations between AUD and consumption indices,

including the AUDIT-C (e.g., rg = 0.52, p = 2.40e-42) (Kranzler et al., 2019; Marees et al., 2019),

while another study found that polygenic risk scores (PRS) for past week alcohol consumption

predicted a modest but significant amount of variance in AUD (e.g., R2 = 0.56%; Johnson et al.,

2019), suggesting that the genetic correspondence between recent consumption and

dependence may be complicated by several factors, including the characteristics of the sample,

and the nature of the assessment (e.g., alcohol quantity vs. frequency (Marees et al., 2019)).

In this study, we sought to examine the extent to which PRS derived from the AUDIT-C and

AUDIT-P GWAS predicted variance in multiple aspects of alcohol use and misuse, ranging from

levels of alcohol consumption, hazardous drinking, and AUD, in four independent samples that

vary in their age and ascertainment scheme. PRS are scores that represent an individual’s

genetic liability for a certain trait or disorder, created by aggregating the effects of many risk

variants for the phenotype of interest, weighting the effect sizes by the number of effect alleles

an individual carries at each locus. Prior evidence indicates that association between PRS and

phenotype in the target sample is improved when both the discovery and target samples for 5

Page 6: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

PRS analyses are derived using similar ascertainment strategies (Savage et al., 2018). Based

on a study showing positive correlations between psychopathology and AUDIT-P (but not

AUDIT-C; Sanchez-Roige et al., 2018), we hypothesized that AUDIT-P PRS would be more

closely related to liability to AUD than would AUDIT-C PRS, which would be more closely

related to aspects of alcohol consumption (e.g., regular consumption, units per week). We also

hypothesized that associations with AUDIT-C would be stronger in the youngest sample while

the AUDIT-P would be more predictive of drinking in older, ascertained samples in which

problem drinking is more established. While there have been some recent studies examining

the genetic overlap between alcohol consumption and indices of problem drinking (e.g.,

Johnson et al., 2019), none have yet compared the performance of consumption (AUDIT-C)

versus problem drinking (AUDIT-P) PRS across multiple samples. Taken together, the current

analyses demonstrate how genetic findings derived from a simple and fast screening tool could

serve to outline the polygenic underpinnings of different stages of alcohol use and problems in

diversely ascertained samples.

MATERIALS AND METHODS

Discovery GWAS

The summary statistics of the recent GWAS of AUDIT-C and AUDIT-P in 121,604 unrelated

individuals of European ancestry from the UK Biobank cohort (Sanchez-Roige et al., 2018) were

used to construct PRS for alcohol-related outcomes in four independent target samples. The UK

Biobank (UKB, www.ukbiobank.ac.uk) is a population-based sample of 502,629 participants

who were recruited from 22 assessment centers across the United Kingdom from 2006 to 2010

(Bycroft et al., 2018); the AUDIT was delivered as part of the online Mental Health

Questionnaire follow-up (Davis et al., 2018). Discovery GWAS analyses were performed using

BGENIE (Bycroft et al., 2018), version 1.1, with AUDIT scores (AUDIT-C score, and AUDIT-P 6

Page 7: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

score, tested independently) as the outcome variable and age, sex, genotyping array, and the

first 20 principal components derived from genotype data as covariates to account for any

remaining population stratification. Further details regarding the discovery GWAS are provided

in the Supplemental Materials.

Target sample demographics and characteristics

Collaborative Study on the Genetics of Alcoholism

The Collaborative Study on the Genetics of Alcoholism (COGA) (Begleiter et al., 1995;

Nurnberger et al., 2004; Schuckit et al., 2018) includes probands meeting criteria for alcohol

dependence, their family members, and community control families. There were 7,645

participants of European ancestry with genotype data available as well as data on alcohol-

related phenotypes; of these, 6,850 were ≥20 years (the mean age of onset of alcohol

dependence) and reported lifetime alcohol use.

Avon Longitudinal Study of Parents and Children

The Avon Longitudinal Study of Parents and Children (ALSPAC) recruited 14,541 pregnant

women residing in Avon, UK, with expected dates of delivery April 1, 1991, to December 31,

1992. The enrolled sample consisted of 14,775 children. Additional details are provided in the

Supplemental Materials. For the current analyses, phenotypic data were available for up to

5,911 participants, depending on the phenotype and wave of assessment. Ethical approval for

the study was obtained from the ALSPAC Ethics and Law Committee and the Local Research

Ethics Committees.

Generation Scotland

The Scottish Family Health Study (GS) is a family-based cohort recruited from the general

population of Scotland from 2006-2011 (N=24,084) (Smith et al., 2013). A subset of GS 7

Page 8: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

participants responded to a re-contact in 2015 (Navrady et al., 2018) and provided additional

information on mental health, which included information on alcohol misuse (N = 17,461).

UK Biobank

Individuals who were part of the AUDIT discovery GWAS (Sanchez-Roige et al., 2018) were

removed from the UKB target sample, leaving 245,947 individuals with phenotypic data for

these analyses.

Alcohol use measures

A summary of the alcohol-related outcomes included in the analyses is presented in Table 1.

Across all samples, participants who did not endorse lifetime alcohol use were removed to

account for those with high genetic liability who elect not to drink for personal or socio-cultural

reasons, or were not old enough to have had a first drink. Each target sample used a different

assessment. Details regarding each assessment are available in the Supplementary

Materials. Briefly, COGA participants were administered the Semi-Structured Interview for the

Genetics of Alcoholism (SSAGA; (Bucholz et al., 1994)), which includes diagnostic assessments

for DSM-IV and DSM-5 AUD as well as other aspects of alcohol use. ALSPAC participants

completed the AUDIT questionnaire, and DSM-IV alcohol dependence was derived for 4,328

participants, using symptom-level items adapted from (Kendler, et al., 1992). In GS, participants

reported on their alcohol intake (units per week) using a pre-clinical questionnaire, and were

also administered the CAGE (Cut-Annoyed-Guilty-Eye) screener (Ewing, 1984). In the UKB,

ICD-9 and 10 codes for AUD were derived from linkage to hospital inpatient records. Phenotypic

correlations between alcohol use measures across the samples are shown in Supplementary

Table 16.

Genotypes8

Page 9: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

Details on genotyping and quality control are shown in the Supplementary Materials.

Polygenic risk score construction

For all samples, PRS were coded for every individual by multiplying an individual’s number of

effect alleles at a particular SNP by that SNP’s effect size (beta) from the discovery GWAS

(Sanchez-Roige et al., 2018) then averaging across SNPs to create one score per person. The

discovery GWAS summary statistics were clumped using PLINK (Chang et al., 2015; Purcell et

al., 2007) using the linkage disequilibrium (LD) pattern from the 1000 Genomes European

(Consortium, 2015) reference sample, with an LD threshold of r2≥0.25 and a 500 kb physical

distance. PRS were constructed for 8 thresholds (pT<0.0001, pT<0.001, pT<0.01, pT<0.10,

pT<0.20, pT<0.30, pT<0.40, pT<0.50) and standardized using the scale function in R within each

sample to ease interpretation of effect sizes. For ALSPAC and COGA, PRS were generated

using PLINK; for UKB and GS samples, PRS were created using PLINK implemented in PRS-

ice2 (Euesden et al., 2014) using the same R2 thresholds for clumping and the same p-value

thresholds.

Statistical Analyses

Linear and logistic regression models were used to examine whether AUDIT-C and AUDIT-P

PRS were associated with each of the alcohol-related phenotypes (see Table 1), and to

determine which PRS threshold (i.e., pT) was most predictive of each measure based on the p-

value and observed (linear) or pseudo (logistic) R2. For the cohorts of unrelated individuals

(ALSPAC and UKB), the partial R2 was extracted from linear regression models for continuous

traits, while Nagelkerke’s pseudo-R2 was extracted from logistic regression models for binary

traits. For the cohorts that employed mixed-effect models to account for within-sample

relatedness (COGA and GS), variance explained by the PRS in the continuous outcomes (e.g.

MaxDrinks, CAGE) was calculated by multiplying the PRS by its regression coefficient and 9

Page 10: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

dividing the variance of that value by the variance of the outcome to derive a coefficient of

determination between 0 and 1 (Nakagawa & Schielzeth, 2012); in COGA, the ‘MuMIn’ package

in R was used to calculate marginal R2 for the logistic mixed-effect models for the binary

outcomes (Barton, 2011). Once the most predictive AUDIT-C and AUDIT-P PRS thresholds

were determined in the single-PRS models (n.b. pT was allowed to vary between AUDIT-C and

AUDIT-P PRS), the most predictive PRS for both AUDIT-C and AUDIT-P PRS were

simultaneously entered into a joint regression model along with covariates (results for single-

PRS models are in available in Supplemental Table 1-12). We primarily report on the results of

the joint (AUDIT-C PRS + AUDIT-P PRS) regression models, as we wished to examine the

relative contribution of each AUDIT subscale PRS while controlling for the other subscale PRS.

As the primary analyses consisted of 2 tests (AUDIT-C PRS and AUDIT-P PRS) for each of the

12 outcomes, and the number of independent tests across the PRS p-value thresholds was

estimated to be approximately 5 (calculated using spectral decomposition, via the matSpD.R R

script (Nyholt, 2004)), we corrected for 120 tests using a Bonferroni p-value = 0.0004.

COGA: Sex, 4 ancestral principal components (chosen via visual inspection of a scree plot of

the eigenvalues), total number of interviews, birth cohort (born 1890-1929, 1930-1949, 1950-

1969, ≥1970), and array type (see Supplemental Materials) were included as fixed effects,

while the family identifier and recruitment site were included as random effects (family nested

within site).

ALSPAC : In keeping with standard practice for genetic analyses in ALSPAC, sex and 10

ancestral principal components were included as covariates. ALSPAC assessments are age-

specific, therefore age was not included as a covariate.

10

Page 11: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

GS: Analyses were performed in AS-REML-R fitting pedigree information as a random effect to

control for relatedness in the sample. Fixed effects included age, sex and 4 multidimensional

scaling components to correct for population stratification.

UKB: Age, sex and 10 ancestry principal components were used as covariates.

Cox proportional hazards survival models for onset of hazardous drinking (AUDIT total score

≥8) in ALSPAC and DSM-IV alcohol dependence in COGA were fit to the data with identical

covariates as the regression analyses. Those who did not meet criteria for hazardous drinking

or alcohol dependence were censored at their age at last interview (since age at birth). For ease

of comparison across the two samples, the same PRS threshold (pT < 0.5) was used in both

COGA and ALSPAC and for both the AUDIT-C and AUDIT-P PRS. Violations of the proportional

hazards assumption for the PRS were tested using scaled Schoenfeld residuals. All analyses

were conducted in R (R Core Team, 2017). In COGA, family identifier and recruitment site were

included in the survival models as a nested frailty term (i.e. `(1 | site/family)`), using the “coxme”

package for mixed-effect Cox proportional hazards models containing Gaussian random effects

(Therneau, 2018). ALSPAC data were analyzed using the “survival” and “survminer” packages

in R (Kassambara et al., 2017; Therneau & Lumley, 2015)

For illustrative purposes, PRS were also split into quartiles for COGA, ALSPAC, and UKB, and

the odds of having an AUD diagnosis (DSM-IV alcohol dependence in COGA and ALSPAC,

ICD-9/ICD-10 alcohol related codes in UKB) was calculated in each quartile using the lowest

quartile as the reference group.

RESULTS

Target sample demographics and characteristics11

Page 12: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

The prevalence of each alcohol use measure for each target sample is shown in Table 1. The

SNP-heritability of AUDIT-C was 11%, while the SNP-heritability of the AUDIT-P score was 9%,

and the genetic correlation (i.e., SNP-rg) between them was rg = 0.70 (as reported in the original

paper (Sanchez-Roige et al., 2018)). See Supplemental Table 15 for the number of SNPs in

each PRS threshold.

COGA: 33.8% of the participants (N = 2,318) met DSM-IV criteria for alcohol dependence

during their lifetime, with 77.6 % of those endorsing ≥6 (i.e., severe) DSM-5 AUD criteria. A

median of 14.0 drinks (SD = 12.9; ~96 grams of alcohol, or 14 standard US drinks) in a single

24-hour period was reported (i.e., MaxDrinks). 90.0% of the participants reported drinking at

least once a month for 6 consecutive months or longer (i.e., monthly alcohol use).

ALSPAC: By age 23, 11.2% (N = 484) had met DSM-IV criteria for alcohol dependence within

the past year at any of the three assessment waves, while 66.2% reported hazardous drinking

(AUDIT total score ≥8). The median MaxDrinks was 15 UK units (SD = 12.4; approximately 120

grams of alcohol, or 8.5 standard US drinks).

GS: Mean alcohol intake was 10.9 units per week (SD = 12.8; approximately 87.2 grams of

alcohol, or 6.2 standard US drinks; N = 17,461) and the mean CAGE score was 0.6 (SD = 0.93).

UKB: There were 4,141 (1.68%) individuals in the subset of UKB selected for analysis who had

at least one ICD-9 or ICD-10 code assigned as a primary or secondary diagnosis of alcohol

related disorders during time as a hospital inpatient.

Associations between AUDIT PRS and consumption measures

COGA: As shown in Table 2, only the AUDIT-P PRS was significantly associated with monthly

alcohol use (R2= 0.65%, p = 1.24e-4) and MaxDrinks (R2= 0.47%, p = 2.00e-8). We also

converted the pseudo-R2 values for all binary outcomes to R2 on the liability scale (Lee et al.,

2012), but as estimates did not differ, these results are not presented.12

Page 13: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

ALSPAC: AUDIT-C and AUDIT-P PRS accounted for increasing proportions of phenotypic

variance in AUDIT-C and AUDIT-P scores as the sample aged, from ~0.50% at age 16 to

1.12% at age 23 (see Supplementary Tables 8-9 for complete results). Here we focus on

results for age 23 for comparability to other samples, which consisted predominantly of adults.

No association survived the correction for multiple tests. The strongest associations observed

were for AUDIT-C and AUDIT-P PRS, which were both associated with higher AUDIT-C scores

in ALSPAC at age 23, each explaining 0.60% of the variance (p = 1.06e-3 and 5.9e-4,

respectively), and between AUDIT-P PRS and MaxDrinks (R2= 3.30%, p = 1.59e-3); a 1-SD

increase in AUDIT-P PRS corresponds to a predicted increase of 0.54 drinks.

GS: The AUDIT-P PRS explained 0.15% of the variance in units per week (p = 4.7e-7) and

0.40% of the variance in CAGE scores (p = 9.0e-7), while the AUDIT-C PRS explained 0.27% of

the variance in units per week (p = 5.5e-11).

Associations between AUDIT PRS and alcohol problem measures

COGA: As shown in Table 2, only the AUDIT-P PRS was significantly associated with DSM-IV

alcohol dependence (R2 = 0.68%, p = 4.55e-9) and DSM-5 AUD symptom count (R2 = 0.67%, p

= 1.01e-10). For individuals in the top quartile of AUDIT-P PRS risk, the odds of being

diagnosed with alcohol dependence was 1.74 (95% confidence intervals (C.I.) = 1.47-2.07)

compared to those in the lowest quartile, while being in the top AUDIT-C PRS quartile was

associated with an odds ratio (OR) of 1.30 (95% C.I. = 1.10 -1.54) compared to those in the

lowest quartile (Figure 1). Comparison of the OR and their 95% C.I. suggest that for AUDIT-C,

those in the 4th quartile are at significantly elevated likelihood of AUD compared to those in

lowest quartile, though there were overlapping 95% C.I. between the 2nd, 3rd, and 4th quartiles.

In contrast, for AUDIT-P, likelihood of AUD escalates at the mid-point, with those in the both the

3rd and 4th quartiles being at significantly greater risk of AUD than those in the lowest quartile

of AUDIT-P PRS.13

Page 14: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

ALSPAC: Only AUDIT-P PRS were associated with AUDIT-P scores, accounting for 1.1% of the

variance (p = 9.05e-6). AUDIT-C PRS and AUDIT-P PRS were both positively associated with

DSM-IV alcohol dependence, explaining 1.0% (p = 4.81e-6) and 0.50% (p = 5.75e-4) of the

variance, respectively, although only the AUDIT-C PRS passed multiple testing corrections.

Both PRS scores were also positively associated with DSM-IV alcohol dependence symptom

count (R2 = 0.8% for both; AUDIT-C p = 3.59e-5, AUDIT-P p = 3.53e-5). For individuals in the

top quartile of AUDIT-C PRS, the odds of being diagnosed with AUD was 1.73 (95% C.I. = 1.29-

2.33) compared to those in the lowest quartile, while being in the top AUDIT-P PRS quartile was

associated with an OR of 1.25 (95% C.I. = 0.93 -1.68) compared to those in the lowest quartile

(Figure 1). Due to wider 95% C.I. (suggesting imprecision potentially due to smaller sample size

and/or less of a dose-response association pattern), it was difficult to draw comparisons of the

ORs across quartiles in the ALSPAC sample. For AUDIT-C, differentiation in likelihood of AUD

appeared to occur at the 3rd quartile, while for AUDIT-P, there was no clear distinction in the

OR across the 2nd, 3rd and 4th quartiles.

UKB: Positive associations with AUD-related ICD codes were observed for both AUDIT-P (R2 =

0.18%, p < 2e-16) and AUDIT-C (R2 = 0.04%, p = 8.9e-5) PRS. For individuals in the top quartile

of AUDIT-P PRS, the odds of having a mental or behavioral disorder due to alcohol was 1.45

(95% C.I. = 1.37-1.54) compared to those in the lowest quartile. For AUDIT-C PRS, being in the

top PRS quartile was associated with an odds-ratio of 1.11 (95% C.I. = 1.02-1.20) compared to

those in the lowest quartile (Figure 1). In the UKB, likelihood of AUD across quartiles seemed

to follow a dose-response pattern with increasing OR for each quartile of polygenic risk for both

AUDIT-C and AUDIT-P (though with overlapping C.I. for AUDIT-C), with the 4th quartile of

AUDIT-P PRS being statistically significantly more likely to be diagnosed with AUD relative to

the 3rd quartile.

14

Page 15: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

Associations between AUDIT PRS and onset of hazardous drinking and alcohol

dependence

COGA: The AUDIT-P, but not the AUDIT-C, PRS was associated with increased hazards of

alcohol dependence onset (AUDIT-P HR = 1.15, 95% CI = 1.10-1.20, p = 4.7e-08;

Supplemental Table 13). Those in the top quartile of AUDIT-P PRS were at considerably

higher risk for onset of alcohol dependence (HR = 1.50, 95% CI = 1.36-1.64, p = 9.7e-10)

relative to any other group; the risk attributable to the top AUDIT-C PRS was lower in magnitude

(HR = 1.27, 95% CI = 1.14-1.40, p = 1.8e-4).

ALSPAC: Only the AUDIT-C PRS was associated with onset of hazardous drinking, although

not significant after multiple testing corrections (Supplemental Table 14; AUDIT-C HR = 1.06,

95% CI = 1.02-1.10, p = 2.28e-3).

DISCUSSION

The course of alcohol use and misuse consists of several stages, each with polygenic and

environmental underpinnings. At earlier ages, how much and how often a person drinks often

serves as an indicator of their genetic susceptibility to alcohol misuse (Dick et al., 2011; Irons et

al., 2015). As an individual progresses towards maladaptive alcohol intake, a unique genetic

susceptibility to physiological and psychological impairments related to alcohol misuse may

unfold (Kendler et al., 2011). Our goal was to outline the extent to which PRS derived from a

discovery GWAS of the AUDIT consumption (AUDIT-C) and problem (AUDIT-P) subscales

predicted variance in a range of alcohol use behaviors across samples that were variously

ascertained. With few exceptions, polygenic liability to AUDIT-P was a superior predictor of

indices of heavy drinking and alcohol problems compared to the AUDIT-C.

Overall, across several ascertainment strategies, AUDIT-P PRS outperformed AUDIT-C as a

predictor of both normative and disordered drinking (see Figure 2 for an overview). For 15

Page 16: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

example, AUDIT-P PRS predicted up to 3.3% of the variance in the lifetime maximum number of

drinks consumed in a 24-hour period (although this was a clear outlier in terms of variance

explained), 1.1% of the variance in AUDIT-P scores, and 0.7% of the variance in DSM-5 AUD

symptom counts. Exceptions to the superior performance of AUDIT-P PRS were observed

within ALSPAC, where AUDIT-C PRS explained higher proportions of variance than AUDIT-P

PRS in some adolescent AUDIT-C and AUDIT-P assessments (prior to age 23; see

Supplemental Materials). AUDIT-C PRS also accounted for more variance in DSM-IV AD in

ALSPAC.

Even indices of alcohol consumption, such as lifetime maximum drinks in 24 hours and monthly

alcohol use, were more closely related to the genetic liability indexed by AUDIT-P PRS than

AUDIT-C PRS. However, this difference was most notable in COGA, for which drinking for at

least 6 months during the lifetime represents a highly heterogeneous group of individuals, which

may include both non-problem drinkers and those with AUD. As expected, and in line with prior

studies (Mies et al., 2018; Sanchez-Roige et al., 2018), AUDIT-P outperformed AUDIT-C for

measures of problem drinking within each sample (i.e., COGA, ALSPAC, GS and UKB; see

Table 2), confirming that the AUDIT-P is more closely related to genetic susceptibility to alcohol

misuse than AUDIT-C.

Overall, the variance explained by PRS was low, even when the discovery and target

phenotypes were identical (e.g. AUDIT scores in ALSPAC). These estimates are nonetheless

consistent with other PRS studies (Mies et al., 2018; Savage et al., 2018), and the proportion of

explained variance may improve as the discovery GWAS get larger. In agreement with previous

studies (Savage et al., 2018), we demonstrated that ascertainment is likely to be a crucial factor

in polygenic prediction analyses. For instance, while the AUDIT-P PRS explained a larger

amount of variance in dependence in COGA than in ALSPAC, the AUDIT-C PRS was not a 16

Page 17: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

significant predictor for any phenotype in COGA over and above the effect of AUDIT-P PRS,

suggesting that the effect sizes for AUDIT-C from a volunteer cohort (UKB) may not be optimal

for capturing genetic risk to AUD in high-risk families, such as those in COGA. AUDIT-C

associations in COGA using other addiction-enriched samples, such as the Million Veteran

Program (Kranzler et al., 2019), where the genetic correlation between AUDIT-C and AUD is

high, will likely provide insights into the extent of the effect of ascertainment differences in

discovery and target samples. Interestingly, variance explained by either PRS was the highest

in ALSPAC, a population-based cohort that is noticeably younger than the discovery sample

(age 39-79 years). Lastly, the variance explained for AUD in the (independent) subset of UKB

participants itself was markedly lower than in ALSPAC. Here, it is worth recognizing that AUDIT

is a past-year screener for alcohol consumption/problems, and there may have been individuals

in the original AUDIT GWAS with low scores who were formerly problem drinkers. Furthermore,

using ICD codes derived from hospital records as a proxy for AUD in the UKB may have

resulted in false negatives; some of the 'controls' could have been problem drinkers but had not

been diagnosed with an alcohol related condition as a hospital inpatient. Thus, the lower

prediction in the UKB sample may reflect the instruments used to measure AUD in both the

discovery and target GWAS.

Our findings should be viewed in the context of several limitations. First, the AUDIT is a past-

year screener for alcohol consumption and problems, and thus may not capture individuals who

had alcohol use problems in the past. Second, the AUDIT discovery GWAS was performed in a

subset of the UKB that consisted of individuals who responded to an email request to complete

a mental health follow-up survey. A recent study of the genomic profiles of these individuals has

found them to represent lower polygenic risk for psychopathology and greater polygenic load for

educational achievement, suggesting selection bias in the discovery sample itself (Adams et al.,

2018). In addition, these individuals had relatively low endorsement of AUDIT-P items, which 17

Page 18: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

might have limited the statistical power of the discovery GWAS to capture the full spectrum of

problem drinking and, in turn, limited our ability to compare across the AUDIT-C and AUDIT-P.

Third, as the discovery GWAS was conducted on individuals of primarily European descent,

these PRS could not be applied to other ancestral groups, reflecting a research gap in the

dearth of large discovery GWAS of non-European populations (Martin et al., 2017). Results from

the recent AUDIT-C GWAS by Kranzler et al (2019), which includes 57,340 African-Americans,

for instance, will be instrumental in bridging this gap. Fourth, it is likely that the pattern of results

reflects a complex interplay between developmental effects, ascertainment strategy, the

psychometrics of the assessment instruments, and even population selection effects. For

instance, distinctions between ALSPAC and COGA could be attributed to the higher median age

of COGA participants, or even drinking patterns prevalent in earlier birth cohorts that are

represented within COGA. Potential moderation of PRS by sex are also possible; however,

power for such exploratory analyses is limited and beyond the scope of this study. Fifth,

although this study examined a range of alcohol use phenotypes (from a measure of monthly

alcohol use to dependence), the cross-sectional design cannot capture potential longitudinal

changes in PRS association across the progression of stages of alcohol use. A sixth limitation is

that we did not control for any measures of socioeconomic status (SES) in our models, although

SES has been shown to be a potential moderator of polygenic risk for alcohol consumption

(Clarke et al., 2016; Barr et al., 2018), as the measures available varied widely and

inconsistently across our target samples. The imbalance of cases and controls in the UK

Biobank target sample is another limitation; this imbalance would be more likely to seriously

bias a discovery GWAS, but may still have confounded the regression analyses in our target

sample. A final limitation is that the strength of PRS associations varied across p-value

thresholds (pT; see Supplementary Tables 1-12); because we chose the best-performing pT for

each model, as is typical for the pruning+thresholding (P+T) method of polygenic risk score

analyses (Euesden et al., 2014), we run the risk of over-fitting. 18

Page 19: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

Collectively, these analyses demonstrate that much of the polygenic liability to alcohol use and

misuse remains unaccounted for – we anticipate that much larger GWAS of phenotypes

representing disordered drinking, in conjunction with other risk indices (e.g., PRS for negative

affect, a hallmark of later stages of AUD) will be necessary to explain additional variance.

Importantly, as the use of PRS becomes increasingly common in attempts to dissect the

transitions from experimentation, to regular or problem use and further, to AUD, differences

between the discovery sample and target sample(s) in age, ascertainment, and other

characteristics will need to be considered in the interpretation of findings. In addition, our

findings highlight the high degree of heterogeneity and polygenicity underlying alcohol use and

misuse – not only is there no “gene for” alcoholism, there is currently no robust “polygenic

indicator for” clinical prediction of liability to problem drinking. Furthermore, while we focused on

contrasting the genetic relationship of each individual AUDIT subscale with the variety of

alcohol-related measures across our different samples, it should be noted that the proportion of

variance explained in the alcohol-related outcomes is larger when we consider both the AUDIT-

C and AUDIT-P PRS together, suggesting that multiple polygenic scores provide more utility

than PRS for the individual sub-scales. The two scores are only modestly correlated and likely,

some of this commonality is due to variants in ADH1B that exert a relatively large effect. Overall,

our analyses suggest that even though it is far too small to be statistically significant, there is

some incremental contribution of variance from inclusion of the AUDIT-C. Despite these

caveats, our analyses demonstrate the feasibility of using a fast and simple screening

questionnaire, the AUDIT, that assesses both consumption and problem drinking, to understand

the course of alcohol use and misuse.

FINANCIAL DISCLOSURES

19

Page 20: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

Dr. McIntosh has received research support from Eli Lilly, Janssen, Pfizer, and the Sackler

Foundation. The remaining authors report no potential conflicts of interest to disclose.

ACKNOWLEDGEMENTS This research was conducted using the UK Biobank Resource, application number 4844.

ECJ was supported by funding from NIAAA (1F32AA027435). S.S-R was supported by the

Frontiers of Innovation Scholars Program (FISP; #3-P3029), the Interdisciplinary Research

Fellowship in NeuroAIDS (IRFN; MH081482), a pilot award from the NIH (DA037844) and a

NARSAD Young Investigator Award from the Brain and Behavior Foundation (Grant Number

27676). SSR and AAP were supported by funds from the California Tobacco-Related Disease

Research Program (TRDRP; Grant Number 28IR-0070 and T29KT0526). AA receives support

from K02DA032573. VH receives support from P60AA03510. DMD acknowledges support from

K02AA018755. JLM is supported by K01DA037914.

ALSPAC: We are extremely grateful to all the families who took part in this study, the midwives

for their help in recruiting them, and the whole ALSPAC team, which includes interviewers,

computer and laboratory technicians, clerical workers, research scientists, volunteers,

managers, receptionists and nurses.

The UK Medical Research Council and Wellcome (Grant ref: 102215/2/13/2) and the University

of Bristol provide core support for ALSPAC. This publication is the work of the authors and

EMC, ACE, and AA will serve as guarantors for the contents of this paper. A comprehensive list

of grants funding is available on the ALSPAC website

(http://www.bristol.ac.uk/alspac/external/documents/grant-acknowledgements.pdf).

20

Page 21: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

This research was specifically funded by NIH grants AA021399, AA018333, and AA022537.

ACE was further supported by a Young Investigator award from the Brain and Behavior

Foundation (Grant Number 24864). GWAS data were generated by Sample Logistics and

Genotyping Facilities at Wellcome Sanger Institute and LabCorp (Laboratory Corporation of

America) using support from 23andMe.

COGA: The Collaborative Study on the Genetics of Alcoholism (COGA), Principal Investigators

B. Porjesz, V. Hesselbrock, H. Edenberg, L. Bierut, includes eleven different centers: University

of Connecticut (V. Hesselbrock); Indiana University (H.J. Edenberg, J. Nurnberger Jr., T.

Foroud; Y. Liu); University of Iowa (S. Kuperman, J. Kramer); SUNY Downstate (B. Porjesz);

Washington University in St. Louis (L. Bierut, J. Rice, K. Bucholz, A. Agrawal); University of

California at San Diego (M. Schuckit); Rutgers University (J. Tischfield, A. Brooks); Department

of Biomedical and Health Informatics, The Children’s Hospital of Philadelphia; Department of

Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia PA (L.

Almasy), Virginia Commonwealth University (D. Dick), Icahn School of Medicine at Mount Sinai

(A. Goate), and Howard University (R. Taylor). Other COGA collaborators include: L. Bauer

(University of Connecticut); J. McClintick, L. Wetherill, X. Xuei, D. Lai, S. O’Connor, M. H.

Plawecki, S. Lourens (Indiana University); G. Chan (University of Iowa; University of

Connecticut); J. Meyers, D. Chorlian, C. Kamarajan, A. Pandey, J. Zhang (SUNY Downstate);

J.-C. Wang, M. Kapoor, S. Bertelsen (Icahn School of Medicine at Mount Sinai); A. Anokhin, V.

McCutcheon, S. Saccone (Washington University); J. Salvatore, F. Aliev, B. Cho (Virginia

Commonwealth University); and Mark Kos (University of Texas Rio Grande Valley). A. Parsian

and H. Chen are the NIAAA Staff Collaborators.

21

Page 22: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

We continue to be inspired by our memories of Henri Begleiter and Theodore Reich, founding PI

and Co-PI of COGA, and also owe a debt of gratitude to other past organizers of COGA,

including Ting-Kai Li, P. Michael Conneally, Raymond Crowe, and Wendy Reich, for their critical

contributions. This national collaborative study is supported by NIH Grant U10AA008401 from

the National Institute on Alcohol Abuse and Alcoholism (NIAAA) and the National Institute on

Drug Abuse (NIDA).

Generation Scotland: Generation Scotland received core support from the Chief Scientist

Office of the Scottish Government Health Directorates [CZD/16/6] and the Scottish Funding

Council [HR03006]. Genotyping of the GS:SFHS samples was carried out by the Genetics Core

Laboratory at the Wellcome Trust Clinical Research Facility, Edinburgh, Scotland and was

funded by the Medical Research Council UK and the Wellcome Trust (Wellcome Trust Strategic

Award “STratifying Resilience and Depression Longitudinally” (STRADL) Reference

104036/Z/14/Z). AMM received support from an MRC Mental Health Data Pathfinder Grant

(reference MC_PC_17209).

22

Page 23: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

REFERENCES

Adams, M., Hill, W. D., Howard, D. M., Davis, K. A. S., Deary, I. J., Hotopf, M., & McIntosh, A. M. (2018). Factors associated with sharing email information and mental health survey participation in two large population cohorts. BioRxiv, 471433. https://doi.org/10.1101/471433

American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders (DSM-5®). American Psychiatric Pub.

Barr, P. B. et al. (2018) ‘Childhood socioeconomic status and longitudinal patterns of alcohol problems: Variation across etiological pathways in genetic risk’, Social Science & Medicine. Elsevier, 209, pp. 51–58.Barton, K. (2011). MuMIn: multi-model inference. R package v. 1.6. 5.Begleiter, H., Reich, T., Hesselbrock, V., Porjesz, B., Li, T.-K., Schuckit, M. A., … Rice, J. P.

(1995). The collaborative study on the genetics of alcoholism. Alcohol Health and Research World, 19, 228.

Berg, J. J., Harpak, A., Sinnott-Armstrong, N., Joergensen, A. M., Mostafavi, H., Field, Y., … Coop, G. (2018). Reduced signal for polygenic adaptation of height in UK Biobank. BioRxiv, 354951. https://doi.org/10.1101/354951

Bucholz, K. K., Cadoret, R., Cloninger, C. R., Dinwiddie, S. H., Hesselbrock, V. M., Nurnberger, J. I., … Schuckit, M. A. (1994). A new, semi-structured psychiatric interview for use in genetic linkage studies: a report on the reliability of the SSAGA. Journal of Studies on Alcohol, 55(2), 149–158. https://doi.org/10.15288/jsa.1994.55.149

Bucholz, K. K., McCutcheon, V. V., Agrawal, A., Dick, D. M., Hesselbrock, V. M., Kramer, J. R., … Porjesz, B. (2017). Comparison of Parent, Peer, Psychiatric, and Cannabis Use Influences Across Stages of Offspring Alcohol Involvement: Evidence from the COGA Prospective Study. Alcoholism: Clinical and Experimental Research, 41(2), 359–368. https://doi.org/10.1111/acer.13293

Bycroft, C., Freeman, C., Petkova, D., Band, G., Elliott, L. T., Sharp, K., … Marchini, J. (2018). The UK Biobank resource with deep phenotyping and genomic data. Nature, 562(7726), 203–209. https://doi.org/10.1038/s41586-018-0579-z

Chang, C. C., Chow, C. C., Tellier, L. C., Vattikuti, S., Purcell, S. M., & Lee, J. J. (2015). Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience, 4(1), 7. https://doi.org/10.1186/s13742-015-0047-8

Clarke, T. et al. (2016) ‘Polygenic risk for alcohol dependence associates with alcohol consumption, cognitive function and social deprivation in a population‐based cohort’, Addiction biology. Wiley Online Library, 21(2), pp. 469–480.Clarke, T.-K., Adams, M. J., Davies, G., Howard, D. M., Hall, L. S., Padmanabhan, S., …

McIntosh, A. M. (2017, July). Genome-wide association study of alcohol consumption and genetic overlap with other health-related traits in UK Biobank (N=112,117). Molecular Psychiatry. The Author(s). https://doi.org/10.1038/mp.2017.153

Consortium, T. 1000 G. P. (2015). A global reference for human genetic variation. Nature, 526(7571), 68–74.

Davis, K. A. S. et al. (2018) ‘Mental health in UK Biobank: development, implementation and results from an online questionnaire completed by 157 366 participants’, BJPsych open. Cambridge University Press, 4(3), pp. 83–90.Degenhardt, L., O’Loughlin, C., Swift, W., Romaniuk, H., Carlin, J., Coffey, C., … Patton, G.

(2013). The persistence of adolescent binge drinking into adulthood: findings from a 15-year prospective cohort study. BMJ Open, 3(8), e003015.

Dick, D. M., Meyers, J. L., Rose, R. J., Kaprio, J., & Kendler, K. S. (2011). Measures of current alcohol consumption and problems: two independent twin studies suggest a complex

23

Page 24: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

genetic architecture. Alcoholism: Clinical and Experimental Research, 35(12), 2152–2161.Edwards, A. C., Heron, J., Vladimirov, V., Wolen, A. R., Adkins, D. E., Aliev, F., … Kendler, K.

S. (2017). The Rate of Change in Alcohol Misuse Across Adolescence is Heritable. Alcoholism: Clinical and Experimental Research, 41(1), 57–64. https://doi.org/10.1111/acer.13262

ENOCH, M.-A. (2006). Genetic and Environmental Influences on the Development of Alcoholism. Annals of the New York Academy of Sciences, 1094(1), 193–201. https://doi.org/10.1196/annals.1376.019

Euesden, J., Lewis, C. M., & O’reilly, P. F. (2014). PRSice: polygenic risk score software. Bioinformatics, 31(9), 1466–1468.

Ewing, J. A. (1984). Detecting Alcoholism: The CAGE Questionnaire. JAMA: The Journal of the American Medical Association, 25(14), 1905–1907. https://doi.org/10.1001/jama.1984.03350140051025

Irons, D. E., Iacono, W. G., & McGue, M. (2015). Tests of the effects of adolescent early alcohol exposures on adult outcomes. Addiction (Abingdon, England), 110(2), 269–278. https://doi.org/10.1111/add.12747

Johnson, E. C., St.Pierre, C. L., Meyers, J. , Aliev, F. , McCutcheon, V. V., Lai, D. , Dick, D. M., Goate, A. M., Kramer, J. , Kuperman, S. , Nurnberger, J. I., Schuckit, M. A., Porjesz, B. , Edenberg, H. J., Bucholz, K. K. and Agrawal, A. (2019), The genetic relationship between alcohol consumption and aspects of problem drinking in an ascertained sample. Alcohol Clin Exp Re. Accepted Author Manuscript. doi:10.1111/acer.14064

Kassambara, A., Kosinski, M., & Biecek, P. (2017). survminer: Drawing Survival Curves using’ggplot2’. R Package Version 0.3, 1.

Kendler, K. S., Gardner, C., & Dick, D. M. (2011). Predicting alcohol consumption in adolescence from alcohol-specific and general externalizing genetic risk factors, key environmental exposures and their interaction. Psychological Medicine, 41(7), 1507–1516. https://doi.org/10.1017/S003329171000190X

Kendler, K. S., Heath, A. C., Neale, M. C., Kessler, R. C., & Eaves, L. J. (1992). A Population-Based Twin Study of Alcoholism in Women. JAMA, 268(14), 1877–1882. https://doi.org/10.1001/jama.1992.03490140085040

Kranzler, H. R., Zhou, H., Kember, R. L., Smith, R. V., Justice, A. C., Damrauer, S., ... & Gelernter, J. (2019). Genome-wide association study of alcohol consumption and use disorder in 274,424 individuals from multiple populations. Nature communications, 10(1), 1499.

Lee, S. H. et al. (2012) ‘A Better Coefficient of Determination for Genetic Profile Analysis’, Genetic Epidemiology. John Wiley & Sons, Ltd, 36(3), pp. 214–224. doi: 10.1002/gepi.21614.Liu, M., Jiang, Y., Wedow, R., Li, Y., Brazel, D. M., Chen, F., … Psychiatry, H. A.-I. (2019).

Association studies of up to 1.2 million individuals yield new insights into the genetic etiology of tobacco and alcohol use. Nature Genetics, 51(2), 237–244. https://doi.org/10.1038/s41588-018-0307-5

Marees, A., Smit, D., Ong, J., MacGregor, S., An, J., Denys, D., . . . Derks, E. (n.d.). Potential influence of socioeconomic status on genetic correlations between alcohol consumption measures and mental health. Psychological Medicine, 1-15. doi:10.1017/S0033291719000357

Martin, A. R., Gignoux, C. R., Walters, R. K., Wojcik, G. L., Neale, B. M., Gravel, S., … Kenny, E. E. (2017). Human demographic history impacts genetic risk prediction across diverse populations. The American Journal of Human Genetics, 100(4), 635–649.

McCutcheon, V. V, Grant, J. D., Heath, A. C., Bucholz, K. K., Sartor, C. E., Nelson, E. C., … Martin, N. G. (2012). Environmental influences predominate in remission from alcohol use disorder in young adult twins. Psychological Medicine, 42(11), 2421–2431.

Mies, G. W., Verweij, K. J. H., Treur, J. L., Ligthart, L., Fedko, I. O., Hottenga, J. J., … Vink, J. 24

Page 25: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

M. (2018). Polygenic risk for alcohol consumption and its association with alcohol-related phenotypes: Do stress and life satisfaction moderate these relationships? Drug and Alcohol Dependence, 183, 7–12.

Nakagawa, S., & Schielzeth, H. (2012). A general and simple method for obtaining R2 from generalized linear mixed-effects models. Methods in Ecology and Evolution, 4(2), 133–142. https://doi.org/10.1111/j.2041-210x.2012.00261.x

Navrady, L. B., Wolters, M. K., MacIntyre, D. J., Clarke, T. K., Campbell, A. I., Murray, A. D., … McIntosh, A. M. (2018). Cohort profile: Stratifying Resilience and Depression Longitudinally (STRADL): A questionnaire follow-up of Generation Scotland: Scottish Family Health Study (GS: SFHS). International Journal of Epidemiology, 47(1), 13–14g. https://doi.org/10.1093/ije/dyx115

Nurnberger, J. I., Wiegand, R., Bucholz, K., O’Connor, S., Meyer, E. T., Reich, T., ... & Bierut, L. (2004). A family study of alcohol dependence: coaggregation of multiple disorders in relatives of alcohol-dependent probands. Archives of general psychiatry, 61(12), 1246-1256.

Nyholt, D. R. (2004) ‘A simple correction for multiple testing for single-nucleotide polymorphisms in linkage disequilibrium with each other’, The American Journal of Human Genetics. Elsevier, 74(4), pp. 765–769.Pagan, J. L., Rose, R. J., Viken, R. J., Pulkkinen, L., Kaprio, J., & Dick, D. M. (2006). Genetic

and environmental influences on stages of alcohol use across adolescence and into young adulthood. Behavior Genetics, 36(4), 483–497. https://doi.org/10.1007/s10519-006-9062-y

Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M. A. R., Bender, D., … Sham, P. C. (2007). PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses. The American Journal of Human Genetics. https://doi.org/10.1086/519795

R Core Team. (2017). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing.

Sanchez-Roige, S., Fontanillas, P., Elson, S. L., Gray, J. C., de Wit, H., Davis, L. K., … Palmer, A. A. (2017). Genome-wide association study of alcohol use disorder identification test (AUDIT) scores in 20 328 research participants of European ancestry. Addiction Biology. https://doi.org/10.1111/adb.12574Sanchez-Roige, S., Palmer, A. A., Fontanillas, P., Elson, S. L., 23andMe Research Team,

Substance Use Disorder Working Group of the Psychiatric Genomics Consortium, … Clarke, T.-K. (2018). Genome-Wide Association Study Meta-Analysis of the Alcohol Use Disorders Identification Test (AUDIT) in Two Population-Based Cohorts. American Journal of Psychiatry. https://doi.org/https://doi.org/10.1176/appi.ajp.2018.18040369

Saunders, J. B., Aasland, O. G., Babor, T. F., De la Fuente, J. R., & Grant, M. (1993). Development of the alcohol use disorders identification test (AUDIT): WHO collaborative project on early detection of persons with harmful alcohol consumption‐II. Addiction, 88(6), 791–804.

Savage, J. E., Salvatore, J. E., Aliev, F., Edwards, A. C., Hickman, M., Kendler, K. S., … Kaprio, J. (2018). Polygenic Risk Score Prediction of Alcohol Dependence Symptoms Across Population‐Based and Clinically Ascertained Samples. Alcoholism: Clinical and Experimental Research, 42(3), 520–530.

Schuckit, M. A., Smith, T. L., Danko, G., Kramer, J., Bucholz, K. K., McCutcheon, V., ... & Hesselbrock, M. (2018). A 22‐Year Follow‐Up (Range 16 to 23) of Original Subjects with Baseline Alcohol Use Disorders from the Collaborative Study on Genetics of Alcoholism. Alcoholism: Clinical and Experimental Research, 42(9), 1704-1714.

Schumann, G., Liu, C., O’Reilly, P., Gao, H., Song, P., Xu, B., … Elliott, P. (2016). KLB is associated with alcohol drinking, and its gene product β-Klotho is necessary for FGF21

25

Page 26: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

regulation of alcohol preference. Proc Natl Acad Sci USA, 113(50), 14372–14377.Smith, B. H., Campbell, A., Linksted, P., Fitzpatrick, B., Jackson, C., Kerr, S. M., … Morris, A. D.

(2013). Cohort Profile: Generation Scotland: Scottish Family Health Study (GS:SFHS). The study, its participants and their potential for genetic research on health and illness. International Journal of Epidemiology. https://doi.org/10.1093/ije/dys084

Therneau, T. M. (2018). Package ‘coxme.’ Mixed Effects Cox Models. R Package Version, 2.Therneau, T. M., & Grambsch, P. M. (2013). Modeling survival data: extending the Cox model.

Springer Science & Business Media.Therneau, T. M., & Lumley, T. (2015). Package ‘survival.’ R Top Doc, 128.Trim, R. S., Schuckit, M. A., & Smith, T. L. (2013). Predictors of Initial and Sustained Remission

from Alcohol Use Disorders: Findings from the 30‐Year Follow‐Up of the S an D iego Prospective Study. Alcoholism: Clinical and Experimental Research, 37(8), 1424–1431.

Walters, R. K., Polimanti, R., Johnson, E. C., McClintick, J. N., Adams, M. J., Adkins, A. E., … Team, 23andMe Research. (2018). Transancestral GWAS of alcohol dependence reveals common genetic underpinnings with psychiatric disorders. Nature Neuroscience, 21(12), 1656–1669. https://doi.org/10.1038/s41593-018-0275-1

Wennberg, P., Andersson, T., & Bohman, M. (2000). Associations between different aspects of alcohol habits in adolescence, early adulthood, and early middle age: a prospective longitudinal study of a representative cohort of men and women. Psychology of Addictive Behaviors, 14(3), 303.

26

Page 27: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

TABLES

Table 1. Alcohol-related measures in each target sample. C, Continuous (linear regression model used); B, Binary (logistic regression model used)

Drinking measure Definitions

Trait Continuous/Binary

COGA Ascertained for AUD

ALSPACPopulation-based

GSPopulation

-based

UKBPopulation-

basedUnits per week

(Mean, SD)C n/a n/a 10.9 (12.8) n/a

Alcohol use(N, %)

B

Ever drank once a month for 6

consecutive months or longer

6,163 (90.0%)

n/a n/a n/a

Maxdrinks(Median, SD)

C

Lifetime maximum drinks consumed in a single 24-hour period

14.0 (12.87)

Lifetime maximum drinks consumed in a single

24-hour period15.0 (12.35)

n/a n/a

Hazardous drinking (N, %)

B n/aAUDIT total score cutoff

of 8+ at any age3264 (66.2%)

n/a n/a

AUDIT-C score(Mean, SD)

C n/a

Sum of AUDIT items 1-3Age 16 4.80 (2.59)Age 18 5.04 (2.26)Age 19 6.01 (2.50)Age 21 6.11 (2.43)Age 23 5.37 (2.31)

n/a n/a

AUDIT-P score (Mean, SD)

C n/a

Sum of AUDIT items 4-10

Age 16 2.37 (3.18)Age 18 2.48 (2.97)Age 19 2.38 (2.89)Age 21 3.16 (3.66)Age 23 2.30 (2.93)

n/a n/a

DSM alcohol dependence

symptom count(Mean, SD)

C

Sum of 11 lifetime criteria for diagnosis of

DSM-5 AUD3.57 (3.68)

Sum of 7 criteria for diagnosis of DSM-IV

AUD; maximum observed across waves

Age 18 0.35 (0.91)Age 21 0.72 (1.30)Age 23 0.43 (1.01)

n/a n/a

DSM-IV alcohol dependence

diagnosisB

Clustering of ≥3 of 7 DSM-IV criteria within

12 months

Clustering of ≥3 of 7 DSM-IV criteria within 12 months at any wave of

n/a n/a

27

Page 28: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

(N, %) 2,318 (33.8%)assessment (age 18, 21,

or 23) 484 (11.2%)

ICD-9 and ICD-10 alcohol

dependence diagnosis (N, %)

B n/a n/a n/a

ICD-9 code of 303* or ICD-10 code of F10* 4,141

(1.68%)CAGE scores(Range 0-4; Mean, SD)

C n/a n/a 0.603 (0.93) n/a

28

Page 29: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

Table 2. Associations between AUDIT-C and AUDIT-P PRS and multiple indices of alcohol use in COGA, UKB, ALSPAC and GS samples.

The PRS threshold (pT) that most strongly predicted each outcome (based on highest R2) is presented. Results significant at p < 0.0004 are bolded.

Complete results are available in Supplemental Tables 1-12.

AUDIT-C PRS AUDIT-P PRS

Outcome Sample pT R2 Beta (SE) p-value pT R2 Beta (SE) p-value

Units per Week GS <0.1 0.27%

0.052 (0.008) 5.50E-11 <0.01

0.15%

0.039 (0.008) 4.70E-07

Alcohol use COGA <0.0001 0.32%

0.129 (0.046) 5.19E-03 <0.4 0.65%

0.188 (0.049) 1.24E-04

AUDIT-C ALSPAC (age 23) <0.1 0.59%

0.066 (0.020) 1.06E-03 <0.01

0.65%

0.067 (0.020) 5.90E-04

Max Drinks ALSPAC <0.5 1.54%

0.389 (0.173) 2.44E-02 <0.2 3.26%

0.540 (0.171) 1.59E-03

COGA < 0.001 0.04%

0.266 (0.147) 7.03E-02 < 0.2

0.47%

0.879 (0.156) 2.00E-08

CAGE GS <0.4 0.19%

0.044 (0.010) 8.60E-04 <0.2 0.40%

0.063 (0.010) 9.00E-07

AUDIT-P ALSPAC (age 23) <0.2 0.28%

0.045 (0.020) 2.61E-02 <0.01

1.12%

0.088 (0.020) 9.05E-06

Alcohol Dependence Symptom Count

ALSPAC (DSM-IV) <0.3 0.84%

0.091 (0.022) 3.59E-05 <0.01

0.84%

0.086 (0.021) 3.53E-05

COGA (DSM-5) < 0.1 0.08%

0.103 (0.045)

2.30E-02 < 0.4

0.67%

0.301 (0.046) 1.01E-10

Alcohol Dependence ALSPAC (DSM-IV) <0.3 0.96%

0.240 (0.052) 4.81E-06 <0.01

0.54%

0.170 (0.050) 5.75E-04

COGA (DSM-IV) <0.0001 0.14%

0.078 (0.032) 1.44E-02 <0.4 0.68%

0.199 (0.034) 4.55E-09

UKB (ICD) <0.0001 0.04%

0.062 (0.016) 8.90E-05 <0.5 0.18%

0.147 (0.020) < 2.00E-16

29

Page 30: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

30

Page 31: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

31

Page 32: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

Figure 1. Odds Ratios (OR) and 95% confidence interval for alcohol dependence

diagnosis by PRS quartiles in ALSPAC, COGA, and UKB. PRS were split into quartiles and

odds ratios calculated for case status for each quartile of risk compared to quartile 1 (lowest). A.

Alcohol dependence in ALSPAC was coded by DSM-IV diagnosis (484 cases, 3,837 controls).

B. Alcohol dependence was coded via DSM-IV diagnosis in the COGA sample (2,318 cases,

4,532 controls). C. A sample of European ancestry, unrelated British individuals who had ever

drank, had ICD9 and ICD-10 codes available, and were not included in the discovery GWAS

were used (4,141 cases, 241,806 controls).

32

Page 33: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

Figure 2. Overview of the magnitude of predicted variance (R2, %) across the alcohol-

related phenotypes probed in the four independent samples (Generation Scotland, GS;

ALSPAC at age 23, A23; Collaborative Study on the Genetics of Alcoholism, COGA; and

UK Biobank, UKB). The color of the dots denotes AUDIT-C (light gray) or AUDIT-P (dark gray)

PRS. Only the significant (p < 0.0004) associations are shown. C, AUDIT-C; P, AUDIT-P.

33

Page 34: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

Supplemental Material

Materials and Methods

Samples

UK Biobank - Discovery GWASThe AUDIT was administered to assess alcohol use over the past year. After quality control procedures were performed to remove participants with missing data, and keeping only white British unrelated individuals, 121,604 individuals with AUDIT total scores were available. AUDIT total score was created by taking the sum of items 1–10, for all participants, including those who endorsed currently never drinking alcohol (as they could still endorse past hazardous use on items 9 and 10). The AUDIT subdomain scores were created by aggregating the scores from items 1–3, which include the information pertaining to alcohol consumption (AUDIT-C, N=121,604), and from items 4–10, which index the information pertaining to alcohol problems (AUDIT-P, N=121,604). These traits were log10-transformed to approximate a normal distribution. See (Sanchez-Roige et al., 2018) for more details.

Target samples

COGAThe Collaborative Study on the Genetics of Alcoholism (COGA) was established to examine familial underpinnings of alcohol use disorders and related behaviors (Begleiter et al., 1995; Bucholz et al., 2017). Alcohol dependent probands were ascertained from inpatient or outpatient treatment facilities across 7 US sites. Subsequently, family members of the probands were also invited to participate. Community individuals were selected from a variety of sources (e.g., dental clinics) and their family members were similarly invited to participate. A proportion of the families in COGA are large and have a high density of alcoholism. The Institutional Review Boards at all sites approved this study and written consent was obtained from all participants. As the PRS were derived from a sample of Europeans, only individuals identified as part of the European American subsample of COGA, determined using genomic data, were included in the analyses reported here (N = 7,645). To avoid confounding by individuals at high genetic risk who elected not to drink due to personal (e.g., religious, cultural, health) reasons, we also excluded individuals reporting no lifetime use of alcohol (N = 335). In addition, we excluded those aged 12-19 (N = 460) as they may not be past the period of maximal risk for onset of alcohol use and problems, and thus may represent censored observations, yielding a final analysis N = 6,850.

ALSPACThere were 14,541 initial pregnancies for which the mothers enrolled in the Avon Longitudinal Study of Parents and Children (ALSPAC) study and had either returned at least 1 questionnaire or attended a “Children in Focus” clinic by July 19, 1999. Of these initial pregnancies, there was a total of 14,062 live births and 13,988 children who were alive at 1 year of age. Subsequent

34

Page 35: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

phases of enrollment increased the sample size over time (Boyd et al., 2013; Fraser et al., 2013). The total sample size for analyses using any data collected after the age of seven is 15,247 pregnancies, resulting in 15,458 foetuses. Of this total sample of 15,656 foetuses, 14,973 were live births and 14,899 were alive at 1 year of age.The phases of enrollment are described in more detail elsewhere (Boyd et al., 2013; Fraser et al., 2013). Only offspring genotypes were used in the current analyses. The study website contains details of all the data that is available through a fully searchable data dictionary (http://www.bristol.ac.uk/alspac/researchers/our-data/). Beginning with the age 22 assessment, online questionnaires were administered using REDCap (Harris et al., 2009). Ethical approval for the study was obtained from the ALSPAC Ethics and Law Committee and the Local Research Ethics Committees. Informed consent for the use of data collected via questionnaires and clinics was obtained from participants following the recommendations of the ALSPAC Ethics and Law Committee at the time.

Generation ScotlandThe Scottish Family Health Study (GS) is a family-based cohort recruited from the general population of Scotland from 2006-2011 (N=24,084). Alcohol consumption was assessed using a preclinical questionnaire and participants were identified as current drinkers, former drinkers or never drinkers. Alcohol intake was self-reported as units consumed in the previous week. After removing former and never drinkers, there were 17,461 individuals with alcohol intake information and genotype data. The 4-item CAGE questionnaire was administered during a re-contact of GS in 2015 (Smith et al., 2013); this questionnaire was originally designed as a screening tool to measure alcohol problems (Ewing, 1984). After removing former and never drinkers there were 6,906 individuals available for analysis with both CAGE data and genotype data. This study obtained informed consent from all participants and was conducted under generic approval from the National Health Service National Research Ethics Service (approval letter dated 17 June 2011, Ref 11/NW/0382). All components of GS have received ethical approval from the National Health Service Tayside Committee on Medical Research Ethics (REC Reference Number: 05/S1401/89) and written consent for the use of data was obtained from all participants.

UKBA White British unrelated subset of the UK Biobank was used in the present study after any individuals who were part of the original discovery GWAS (Sanchez-Roige et al., 2018) were removed from the sample. ICD-9 and ICD-10 codes derived from hospital in-patient stays were used to create ‘cases’. We used F10 ICD-10 codes (F10.1 - F10.9) ‘Mental and Behavioural Disorders due to the use of alcohol’ and 303.9 (Other and unspecified alcohol dependence) and 305.0 (Non-dependent alcohol abuse) ICD-9 codes. If UKB participants had been assigned any of these ICD-9 or ICD-10 codes as a primary or secondary diagnosis as a hospital in-patient then they were assigned case status. All other individuals were assigned control status. Controls were further refined to exclude never drinkers and also removing individuals who had identified as ever being ‘physically dependent on alcohol’ or ‘addicted to alcohol’. This resulted in a final sample size of 4,141 cases and 241,806 controls.

35

Page 36: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

Alcohol-related measures

Units per week (GS only)Units per week was a self-report measure ascertained using a pre-clinical questionnaire where participants were asked to report on their average weekly alcohol intake. They were prompted with a picture of typical drink sizes and the units contained in each drink.

Alcohol use (COGA only)Participants were asked whether there was a period of time when they were drinking at least one drink a month for 6 or more consecutive months.

MaxDrinksIn both COGA and ALSPAC, participants were asked to report the highest number of standard drinks they had ever consumed in a 24-hour period. A standard US drink contains 14 grams of ethanol, whereas a standard UK drink contains 8 grams of ethanol. Participants in both samples were administered this item across multiple waves; the highest reported MaxDrinks reported by each individual was used as the phenotype of interest.

AUDIT scores (ALSPAC only)ALSPAC participants were administered the full 10-item AUDIT questionnaire at ages 16, 18, 19, 21, and 23. At each wave, participants were asked to consider their alcohol use within the past year. AUDIT-C scores correspond to the scores from items 1-3. AUDIT-P scores correspond to scores from items 4-10. Scores on the total AUDIT were used to derive the hazardous drinking measure: A score of 8 or higher is coded as 1; scores <8 are coded as 0. Scores were derived using the code recommended in (Organization, 2001). Those who reported lifetime abstinence were coded missing for all AUDIT-related outcomes.

CAGE scores (GS only)The CAGE questionnaire is a screener for problem drinking and consists of 4 items assessing a need to Cut down on drinking, Annoyance or criticism from others regarding drinking, Guilt about drinking, and needing an Eye-opener or drink first thing in the morning (Ewing, 1984). Binary responses across the items were summed, and former and never drinkers were excluded. The CAGE was a self-administered questionnaire used as part of a larger GS follow-up.

36

Page 37: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

Alcohol dependence-related outcomes

COGA

Alcohol dependence symptoms and diagnosis were assessed in COGA individuals using the Semi-Structured Interview for the Genetics of Alcoholism (SSAGA; (Bucholz et al., 1994; Hesselbrock et al., 1999)), which includes diagnostic assessments for DSM-IV and DSM-5 alcohol use disorder (AUD). The highest sum of DSM-5 AUD symptom count across available assessments was used as the DSM-5 AUD symptom count outcome in analyses. If an individual had ever endorsed 3 or more DSM-IV alcohol dependence criteria at any assessment, they were coded as 1 for DSM-IV AD; otherwise they were coded as 0.

ALSPAC

Alcohol dependence symptoms were assessed in ALSPAC participants at ages 18, 21, and 23, using items that correspond to DSM-IV criteria, adapted from those described in (Boyd et al., 2013). Items instructed participants to reflect on their alcohol use and related behaviors over the previous year. These items were summed to create a symptom count (DSM-IV AD symptom count). The highest symptom count within a wave was selected as the outcome. If a participant had ever endorsed 3 or more criteria within a wave, they were coded as 1 for DSM-IV alcohol dependence (DSM-IV AD); otherwise they were coded 0. Those who reported lifetime abstinence were coded missing.

UK Biobank

ICD-9 and ICD-10 codes derived from hospital in-patient stays were used to create ‘cases’. We used F10 ICD-10 codes (F10.1 - F10.9) ‘Mental and Behavioural Disorders due to the use of alcohol’ and 303.9 (Other and unspecified alcohol dependence) and 305.0 (Non-dependent alcohol abuse) ICD-9 codes.

Genotyping, quality control and imputation

Discovery GWAS of AUDIT scoresGenotype imputation was performed on 487,409 individuals by the UK Biobank team, using IMPUTE4 (Bycroft et al., 2018) and the Haplotype Reference Consortium reference panel. After quality control, 16,213,998 single-nucleotide polymorphisms (SNPs) remained for GWAS analyses. GWAS analyses were performed using BGENIE (Bycroft et al., 2018), version 1.1. See (Sanchez-Roige et al., 2018) for further details.

COGAGenotyping for the COGA European American participants was performed using the Illumina 1M, Illumina OmniExpress, and Illumina 2.5M (Illumina, San Diego, CA), and Smokescreen (BioRelm, Walnut, CA) arrays. Array type was included as a covariate in all analyses. A pruned set of 47,000 variants that were genotyped on all platforms and had minor allele frequencies (MAF) > 10% in the combined samples, Hardy-Weinberg Equilibrium (HWE) p-values > 0.001,

37

Page 38: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

missing rates < 2%, and were not in linkage disequilibrium (LD, defined as R2< 0.5) were used to assess reported pedigree structure using identity-by-descent calculations in PLINK (Purcell et al., 2007). Family structures were altered as needed and SNP genotypes were tested for Mendelian inconsistencies with the revised family structure (O’Connell & Weeks, 1998). Genotype inconsistencies were set to missing. Imputation was to 1000 Genomes (EUR and AFR, Phase 3, b37, October 2014; build hg19) using SHAPEIT2 (Delaneau et al., 2011) and then Minimac3 (Das et al., 2016). Imputed SNPs were then converted to best guess genotypes. Imputed SNPs with information (INFO) scores < 0.30 or individual genotype probability scores < 0.90 were excluded, as were palindromic SNPs (A/T or C/G), monomorphic SNPs, SNPs with a genotyping rate of < 95%, SNPs that did not pass Hardy-Weinberg equilibrium (HWE p < 1e-6), and SNPs with a minor allele frequency (MAF) less than 0.05%. In total, 6,881,872 SNPs passed quality control and data cleaning thresholds and were available for PRS analysis.

ALSPACGenotyping in ALSPAC was performed on the Illumina HumanHap550 quad genome-wide SNP genotyping platform by 23andMe subcontracting the Wellcome Trust Sanger Institute, Cambridge, UK, and the Laboratory Corporation of America, Burlington, NC, USA. Individuals were excluded from analyses on the basis of excessive or minimal heterozygosity, gender mismatch, individual missingness (0.3%), cryptic relatedness as measured by identity by descent (genome-wide IBD 0.10%) and sample duplication. Individuals were assessed for population stratification using multi-dimensional scaling modelling seeded with HapMap Phase II release 22 reference populations. Individuals of non-European ancestry were removed from further analysis. ShapeIt v2 was used to impute to 1000 Genomes Phase 1, Version 3, Release December 2013. We excluded markers with MAF<0.01, deviation from HWE (p < 5 x 10-6), genotyping rate <0.95, or INFO <0.80. Polygenic risk scores were derived using hard-called data in order to be consistent with samples in which PRSice was used.

Generation ScotlandGenotyping was performed using the Illumina OmniExpress BeadChip and after quality control which removed SNPs with a call rate <98%, a minor allele frequency (MAF) of <1% or those showing deviation from HWE (p < 5 x 10-6), 561,125 autosomal SNPs and 19,904 individuals were available for analysis.

UKBWe removed 131,790 related individuals who were third degree relatives or closer (using a kinship coefficient > 0.044). From these related individuals we identified one individual from each group of relatives by creating a genomic relationship matrix and using a genetic-relatedness cut-off of 0.025 and added these back into the sample (N = 55,745). Variants were removed with a call rate < 98%, a minor allele frequency < 0.001 and those that deviated significantly from Hardy-Weinberg equilibrium (p ≤ 5 x 10-6). Polygenic risk scores were generated in the non-imputed genotype data for this target sample, due to the computational burden of creating polygenic scores with imputed data on >250,000 individuals (see Ware et al., 2019 for a recent preprint showing that PRS derived from directly genotyped SNPs are comparable to PRS derived from imputed SNPs.)

38

Page 39: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

Supplemental Results

AUDIT outcomes across adolescence in ALSPACThe primary text focuses on age 23 AUDIT outcomes in ALSPAC. Parallel analyses were run for AUDIT-C and AUDIT-P scores at ages 16, 18, 19, and 21 (full results in Supplementary Tables 8 and 9).

AUDIT-C scores. Focusing first on the top and middle rows of Supplementary Figure 1, AUDIT-C PRS overall accounted for more variance in AUDIT-C scores from age 16-21 (relative to AUDIT-P PRS). AUDIT-C score variance accounted for by AUDIT-C PRS increased across adolescence into early adulthood, peaking at age 21, then declined. With the exception of a drop at age 21, AUDIT-P PRS gradually explained an increasing proportion of AUDIT-C score variance across time. As noted in the primary text, at age 23, AUDIT-P PRS better predicted AUDIT-C scores.

AUDIT-P scores. Variance accounted for in AUDIT-P scores by AUDIT-C PRS increased modestly across ages, with a peak at age 21 before declining at age 23. The increase in R2 across time was steadier and more pronounced when AUDIT-P PRS was the predictor, with the exception of age 21.

For both AUDIT-C and AUDIT-P outcomes, a higher variance was accounted for by PRS as the sample aged, approximately tripling from age 16 to 23 (bottom row of the Supplementary Figure 1).

39

Page 40: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

References

Begleiter, H., Reich, T., Hesselbrock, V., Porjesz, B., Li, T.-K., Schuckit, M. A., … Rice, J. P. (1995). The collaborative study on the genetics of alcoholism. Alcohol Health and Research World, 19, 228.

Boyd, A., Golding, J., Macleod, J., Lawlor, D. A., Fraser, A., Henderson, J., … Smith, G. D. (2013). Cohort profile: The ’Children of the 90s’-The index offspring of the avon longitudinal study of parents and children. International Journal of Epidemiology. https://doi.org/10.1093/ije/dys064

Bucholz, K. K., Cadoret, R., Cloninger, C. R., Dinwiddie, S. H., Hesselbrock, V. M., Nurnberger, J. I., … Schuckit, M. A. (1994). A new, semi-structured psychiatric interview for use in genetic linkage studies: a report on the reliability of the SSAGA. Journal of Studies on Alcohol, 55(2), 149–158. https://doi.org/10.15288/jsa.1994.55.149

Bucholz, K. K., McCutcheon, V. V., Agrawal, A., Dick, D. M., Hesselbrock, V. M., Kramer, J. R., … Porjesz, B. (2017). Comparison of Parent, Peer, Psychiatric, and Cannabis Use Influences Across Stages of Offspring Alcohol Involvement: Evidence from the COGA Prospective Study. Alcoholism: Clinical and Experimental Research, 41(2), 359–368. https://doi.org/10.1111/acer.13293

Bycroft, C., Freeman, C., Petkova, D., Band, G., Elliott, L. T., Sharp, K., … Marchini, J. (2018). The UK Biobank resource with deep phenotyping and genomic data. Nature, 562(7726), 203–209. https://doi.org/10.1038/s41586-018-0579-z

Das, S., Forer, L., Schönherr, S., Sidore, C., Locke, A. E., Kwong, A., … Fuchsberger, C. (2016). Next-generation genotype imputation service and methods. Nature Genetics, 48, 1284.

Delaneau, O., Marchini, J., & Zagury, J.-F. (2011). A linear complexity phasing method for thousands of genomes. Nature Methods. https://doi.org/10.1038/nmeth.1785

Ewing, J. A. (1984). Detecting Alcoholism: The CAGE Questionnaire. JAMA: The Journal of the American Medical Association, 25(14), 1905–1907. https://doi.org/10.1001/jama.1984.03350140051025

Fraser, A., Macdonald-wallis, C., Tilling, K., Boyd, A., Golding, J., Davey smith, G., … Lawlor, D. A. (2013). Cohort profile: The avon longitudinal study of parents and children: ALSPAC mothers cohort. International Journal of Epidemiology. https://doi.org/10.1093/ije/dys066

Harris, P.A., Taylor, R., Thielke, R., Payne, J., Gonzalez, N., Conde, J.G. Research electronic data capture (REDCap) – A metadata-driven methodology and workflow process for providing translational research informatics support, J Biomed Inform. 2009 Apr;42(2):377-81.

Hesselbrock, M., Easton, C., Bucholz, K. K., Schuckit, M., & Hesselbrock, V. (1999). A validity study of the SSAGA - A comparison with the SCAN. Addiction.

McCutcheon, V. V, Schuckit, M. A., Kramer, J. R., Chan, G., Edenberg, H. J., Smith, T. L., … Bucholz, K. K. (2017). Familial association of abstinent remission from alcohol use disorder in first-degree relatives of alcohol-dependent treatment-seeking probands. Addiction, 112(11), 1909–1917. https://doi.org/10.1111/add.13890

O’Connell, J. R., & Weeks, D. E. (1998). PedCheck: A Program for Identification of Genotype Incompatibilities in Linkage Analysis. The American Journal of Human Genetics, 63(1), 259–266. https://doi.org/10.1086/301904

Organization, W. H. (2001). AUDIT: The alcohol use disorders identification test: Guidelines for use in primary health care. Geneva: World Health Organization.

Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M. A. R., Bender, D., … Sham, P. C. (2007). PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses. The American Journal of Human Genetics.

40

Page 41: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

https://doi.org/10.1086/519795Sanchez-Roige, S., Palmer, A. A., Fontanillas, P., Elson, S. L., 23andMe Research Team,

Substance Use Disorder Working Group of the Psychiatric Genomics Consortium, … Clarke, T.-K. (2018). Genome-Wide Association Study Meta-Analysis of the Alcohol Use Disorders Identification Test (AUDIT) in Two Population-Based Cohorts. American Journal of Psychiatry. https://doi.org/https://doi.org/10.1176/appi.ajp.2018.18040369

Smith, B. H., Campbell, A., Linksted, P., Fitzpatrick, B., Jackson, C., Kerr, S. M., … Morris, A. D. (2013). Cohort Profile: Generation Scotland: Scottish Family Health Study (GS:SFHS). The study, its participants and their potential for genetic research on health and illness. International Journal of Epidemiology. https://doi.org/10.1093/ije/dys084

Ware, E.B.,  Schmitz, L.L.,  Faul, J., Gard, A., Mitchell, C., Smith, J.A., Zhao W., Weir, D., Kardia, S.L.R., bioRxiv 106062; doi: https://doi.org/10.1101/106062

41

Page 42: €¦  · Web viewTitle: Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Authors: Emma C. Johnson1*,

Supplementary Figure 1. Nagelkerke’s pseudo-R2 results are plotted for AUDIT-C and AUDIT-P outcomes in ALSPAC. Each column represents the age at which participants completed the AUDIT assessment. The columns are further divided into two batches of bars; the left batch within a column represents R2 values for the AUDIT-C outcome and the right batch within a column represents the AUDIT-P outcome. The color of each bar corresponds to the p-value threshold of the PRS used as predictor(s) (see legend). Each row corresponds to the PRS predictor of interest: R2 change when AUDIT-C PRS is dropped from the joint model (top row), R2 change when AUDIT-P PRS is dropped from the joint model (middle row), and R2 change when both PRS are dropped from the model. The top- and left-most batch of bars corresponds to R2 accounted for by AUDIT-C PRS in a model predicting AUDIT-C scores at age 16; the bottom- and right-most batch of bars corresponds to the R2 accounted for by both AUDIT-C and AUDIT-P PRS when predicting AUDIT-P scores at age 23.

42